Constructing holographic spacetimes using entanglement renormalization 
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We elaborate on our earlier proposal connecting entanglement renormalization and holographic duality in 
which we argued that a tensor network can be reinterpreted as a kind of skeleton for an emergent holographic 
space. Here we address the question of the large iV limit where on the holographic side the gravity theory 
becomes classical and a non-fluctuating smooth spacetime description emerges. We show how a number of 
features of holographic duality in the large N limit emerge naturally from entanglement renormalization, in- 
cluding a classical spacetime generated by entanglement, a sparse spectrum of operator dimensions, and phase 
transitions in mutual information. We also address questions related to bulk locality below the AdS radius, 
holographic duals of weakly coupled large N theories, Fermi surfaces in holography, and the holographic inter- 
pretation of branching MERA. Some of our considerations are inspired by the idea of quantum expanders which 
are generalized quantum transformations that add a definite amount of entropy to most states. Since we identify 
entanglement with geometry, we thus argue that classical spacetime may be built from quantum expanders (or 
something like them). 



I. INTRODUCTION 

A few years ago we proposed a connection between two 
new but superficially quite different approaches to quantum 
many-body physics, entanglement renormalization and holo- 
graphic duality QjJ] . Entanglement renormalization is moti- 
vated by considerations of entanglement in quantum matter 
while holographic duality emerges out of string theory, but 
both frameworks organize information in a quantum many- 
body system as a function of length scale according to the 
renormalization group. After our initial proposal, a substan- 
tial body of work appeared supporting and extending the orig- 
inal idea l2T-[l71l. Related work attempting to give formal 
holographic duals of generic field theories appeared in Refs. 
IU2T414I1 . The connection between entanglement and geome- 
try has also been studied in Ref. fisl) . In this paper we es- 
tablish firmer contact between entanglement renormalization 
and holographic duality by considering entanglement renor- 
malization in systems with many local degrees of freedom. 
On the holographic side, many intriguing features emerge in 
the large A^ limit including a classical geometry, entropy pro- 
portional to area, and a peculiar spectrum of operator scaling 
dimensions lma - 12011 . It is our goal to show how all these fea- 
tures and more emerge naturally from entanglement renormal- 
ization. 

Such a connection between entanglement renormalization 
and holographic duality is important. Entanglement renormal- 
ization sheds light on the microstructure of holographic space- 
time and gives us a guide to holographic duality away from 
large A^ and strong coupling. Thus we can learn about the 
physics of quantum gravity using tools of quantum informa- 
tion science and quantum many-body physics. Holography, 
which appears to encode entanglement so naturally, can also 
teach us about quantum matter. Indeed, two of the key prob- 
lems of quantum matter are the understanding of many-body 
entanglement and the development of general entanglement 
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based simulation tools. We already know that entanglement 
renormalization can solve the one dimensional quantum Ising 
chain, so if it can also handle super-symmetric gauge theory 
in three dimensions then surely it can handle anything. More 
practically, holography inspires new perspectives on entangle- 
ment renormalization including extensions to time dependent 
systems. As these two subjects are brought closer together, 
the exchange should enrich both disciplines. 

To begin we must understand the physical content of en- 
tanglement renormalization and holographic duality. Entan- 
glement renormalization [,21] is a real space renormalization 
group whereby quantum states, often ground states, are rep- 
resented via a hierarchical tensor network structure. This net- 
work geometry is motivated by the structure of entanglement 
in local quantum systems, with the physical picture being that 
we remove short range entanglement, coarse grain, and then 
repeat at a longer length scale. We always only remove local 
entanglement at each step, so any long range entanglement is 
preserved deep into the infrared (IR) where it gives a clean 
characteristic of the quantum phase. Entanglement renormal- 
ization can also be viewed as a variational state, known as 
the multi-scale entanglement renormalization ansatz (MERA) 
ll22ll . with a structure dictated by the removal of local entan- 
glement followed by coarse-graining. Such a formulation has 
a number of desirable features including efficient computation 
of correlators, efficient encoding of operator dimensions, and 
a simple representation of entanglement. 

Holographic duality arose out of string theory iflrjUTctl 
but is now understood as being considerably more general 
||20[ l23ll . The duality states that an ordinary quantum field 
theory in d + 1 dimensions is exactly equivalent to a theory 
of quantum gravity in a higher dimensional bulk geometry. 
Every observable in the field theory has a corresponding rep- 
resentation in the gravity theory; for example, black holes in 
gravity are dual to finite temperature states in the field the- 
ory. The best known example is the duality between Af = 4 
SU(N) Yang-Mills theory in 3 + 1 dimensions and type IIB 
string theory in an asymptotically AdS, 5 x S 5 spacetime If24ll . 
Ignoring the sphere (it is related to a large symmetry of the 
field theory), the extra holographic dimension in AdS^ is re- 
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lated to renormalization group scale in the dual field theory. 
Indeed, the ultraviolet (UV) of the field theory lives at the 
conformal boundary of AdS while the IR of the field theory 
lives deep inside the AdS space. 

Our proposal is that these two frameworks, entanglement 
renormalization and holographic duality, are really two ver- 
sions of the same basic structure. In Ref. Oil we identi- 
fied the emergent holographic direction with the number of 
coarse graining steps in entanglement renormalization. We 
also showed how the resulting network describing entangle- 
ment renormalization could be interpreted as a discrete ap- 
proximation to AdS where degree of quantum correlation, as 
measured by entanglement entropy S = — tx(p In (p)), was 
encoded geometrically in a metric ds 2 . In short, ds 2 = dS 2 
(literally true in d = 1), so that the holographic geometry is 
built out of entanglement. Our precise results included estab- 
lishing a bound on the entropy in terms of discrete minimal 
curves, giving a geometric picture of correlation functions in 
terms of geodesies, and showing how black hole-like objects 
appeared at finite temperature. 

In this paper we study entanglement renormalization in the 
context of large N models, which are models with a large 
number of local degrees of freedom. These include vector 
models like the 0(N) model 112511 and matrix models based 
on a large SU (N) gauge group 12411 . We will primarily focus 
on the matrix model case, for reasons that will become clear 
as we go, but we will also discuss the situation with vector 
models. Our primary goals include understanding the emer- 
gence of classical geometry and the peculiar spectrum of op- 
erator scaling dimensions in strongly coupled matrix theories 
ll20ll . However, we will also obtain a number of other salient 
features of the duality at large N and strong coupling. 

From the perspective of entanglement as geometry, roughly 
what we want is a quantum circuit or RG transformation that 
adds a definite amount of entanglement. It turns out that there 
is already such a structure in quantum information science 
known as a quantum expander lf26l l27ll . Hence we want to 
argue that classical space is made of quantum expanders (or 
something like them). A quantum expander may be under- 
stood as a generalized transformation of a quantum state (den- 
sity matrix) that adds a definite amount of entropy for almost 
all inputs. Although we will not make precise use of the quan- 
tum information notion, it is an important motivation which 
we review below. 

Now for a preview of our main results. We argue that, at 
large N and strong coupling, the tensors of the tensor net- 
work will be strongly mixing which implies both the addition 
of a definite entropy and the largeness of most operator di- 
mensions. In this context, we point out the potential relevance 
of quantum expanders and related notions to spacetime ge- 
ometry. We also derive the phase transition structure of holo- 
graphic mutual information from entanglement renormaliza- 
tion, and furthermore, we show how to compute sub-leading 
corrections. We show how to obtain various n-point functions 
from the geometric structure of entanglement renormalization 
and point out a similarity to Witten diagrams. We also give a 
holographic interpretation of the branching MERA and a new 
perspective on Fermi surfaces that should have holographic 



ramifications. Finally, we recast our considerations in the con- 
text of continuous entanglement renormalization, discuss the 
idea of RG causality, elaborate on our black hole-like objects, 
and obtain recent holographic metrics with logarithmic entan- 
glement entropy from a field theory construction. 

This is a long paper, so we indicate where the main results 
are located. Sec. II is the toolbox where we review certain 
key ideas including conformal symmetry, large N field theo- 
ries, entanglement and entanglement renormalization, and the 
idea of quantum expanders. Only the entanglement renormal- 
ization section is really crucial for what comes after. Sec. Ill 
contains the first of our main arguments in the context of large 
N entanglement renormalization on a lattice. There we dis- 
cuss operator dimensions, the emergence of area, the struc- 
ture of mutual information and correlations, the idea of RG 
causality, and some ideas on Fermi surfaces. Sec. IV is a 
shorter section in which we compare the results of Sec. Ill to 
holography proper. In Sec. V we reformulate our results in the 
context of continuous MERA. There we discuss entanglement 
and correlations, the idea of RG causality, black hole-like ob- 
jects, the idea of entanglement per scale, and a field theory 
construction of holographic metrics supporting area law vio- 
lations. Finally, we conclude the paper with a discussion of 
numerous open issues and future directions. 



II. TOOLBOX 

Here we briefly review some background information nec- 
essary for the arguments in the paper. We begin with scale 
invariance in quantum field theory with a special emphasis on 
conformal symmetry. This symmetry is important because a 
critical part of our discussion concerns primary (scaling) oper- 
ators and their scaling dimensions. Then we discuss the large 
N limit in field theory and describe some of the systems of rel- 
evance to us. Next we review entanglement renormalization 
and the proposal in Ref. [Q]]. Finally, we review the notion 
of quantum expanders and random unitaries and briefly dis- 
cuss their role in inspiring our ideas. Readers familiar with 
some of this material are encouraged to skip forward to the 
new arguments in Sec. III. 



A. Conformal symmetry 

We will be primarily interested in scale invariant theories. 
In the relativistic case these often turn out to also be confor- 
mally invariant (see Ref. J28ll for a friendly introduction). We 
take the metric of the spacetime in which the field theory lives 
to have g u < 0. For example, in two dimensional flat space 
we would have ds 2 = gudt 2 + g xx dx 2 with g u = —1 and 
g xx = 1. Conformal transformations as those that preserve 
the form of the metric up to rescaling and are a generalization 
of rotations, translations, and scaling transformations or di- 
latations, d always refers to the spatial dimension of the field 
theory. 

The conformal group is generated by the momentum (trans- 
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lations), 



the angular momentum (rotations/boosts), 

M^ v = i{x^d v -x v dfj,), 
the dilatation generator (dilatations), 

D = -ix"d ut 



(2.1) 



(2.2) 



(2.3) 



and the special conformal generators (special conformal trans- 
formations), 



K„ = -i(x ■ xd,. 



d). 



(2.4) 



We can easily check, for example, that the translation operator 
T(a) = e laP changes x as 

Tiajx^T- 1 ^) = e^^x^e-^ 9 " = x^ + (2.5) 

while the dilatation generator gives 

e lbD x l "e~ lbD = bx". (2.6) 

More generally, in a field theory with conformal symmetry 
(CFT) we can relate all these generators to the stress tensor 
T Mi , as follows: 

P* = I d d xT fi \ 



M" v = - J d d x {x»T vt - x v T^) , 
D = J d d x(x^ t ) 1 

I d d x (x ■ xT^ - x»x v T vt ) . (2.7) 



The expression for P^ gives insight into the conventional 
physics of the stress tensor. T tt or T 00 is the energy density 
while T M is the momentum density in the x % -direction. Re- 
lated to each generator is a conserved current. For example, 



the dilatation current is Jp = x v T vli so that 



D = I d d xJ t 



and 



(2.8) 



(2.9) 



where the last equality follows because the stress tensor is 
conserved and traceless (in a CFT). 

Besides the usual Lorentz algebra, the important new com- 
mutation relations involve the dilatation and special conformal 
generators. These relations include 

[D,P fi ]=iP ti , 



\D,Kn 



-%K„ 



[K fl ,P v ] = 2i(g fil/ D-M^), 



(2.10) 



in addition to the statement that D is a scalar and is a vec- 
tor under rotations. Since the momentum has scaling (mass) 
dimension one, we expect that an operator O with definite 
scaling dimension A will satisfy [D, O] = iAO. The [D, K^] 
commutator then implies that lowers the dimension of any 
operator on which is acts, that is 

[D, [Kp, O]] = -[K M , [O, D)] - [O, [D, K M ]] 



-[K^-iAOl-fo-iKp] 
i(A-l)[K M ,0]. 



(2.11) 



Since the scaling dimension of sensible operators must be pos- 
itive we see that the special conformal transformations must 
annihilate some fields, called primary fields. 

As an aside, special conformal transformations may be un- 
familiar to some readers. They can be understood as the fol- 
lowing three step process. First, we invert all coordinates 



x" -> — , 

x 

then we translate the inverted coordinates 



x* 1 



.(■'' 
T2" 



ar ar 

and finally we invert the coordinates again 



(2.12) 



(2.13) 



x 2 b» 



x~ 2 + 2b ■ x/x 2 +b 2 l + 2b-x + b 2 x 2 ' 



(2.14) 

Although we tend to focus on simple scaling transformations, 
these more complex special conformal transformations also 
provide important constraints on correlation functions. Addi- 
tionally, they play an important role in the entanglement en- 
tropy of spheres in CFTs lf29ll . 

Finally, let us mention the concept of an operator product 
expansion. This permits us to make sense of products of scal- 
ing operators in the continuum theory. When two scaling op- 
erators are brought close together, a singularity will develop 
which is cutoff by the lattice spacing. In the continuum we 
may write 

Oi{x)Oj{y) ~ Y,°U X - v)°k(.y) (2-15) 



where the operator product expansion (OPE) coefficients ck 
are singular as x approaches y. If we suppress spin labels then 
the OPE coefficients have the form 



4(x-y) = c% 



1 



\ x _ y ] j A i +Ai-A k 



(2.16) 
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which is indeed singular for small enough A^. The meaning 
of this expansion is that it becomes an equality when inserted 
into correlation functions with no other nearby operators. The 
fields Ok appearing on the right hand side are called fusion 
products of the fusion of Oi and Oj. The OPE coefficient 
for the identity Ok = 1 is simply the two-point of the two 
operators. More generally, the full set of OPE coefficients and 
operator dimensions are a crucial part of the data defining a 
conformal field theory (see for example Ref. 1I20I1 ). 



B. Large TV limit 

The simplest example of a large TV theory is provided by 
the vector model with O(Nf) symmetry. Here TV = Nf and 
the basic fields are Nf component vectors (f) a . The simplest 
Lagrangian incorporating this field is provided by the free the- 
ory 

C = \((d$?- m 2 ($f). (2.17) 

Correlators obey (<j> a <t>b) = Sab, so the theory is equivalent to 
TV copies of a free scalar field. We can add interactions to C 
via a quartic term as follows 

C = \ {{d$f - m 2 (0) 2 ) - 9i (($) 2 ) 2 . (2.18) 

Note however that the last term contains roughly TV 2 terms 
while the first terms both contain TV terms, so we see that to 
keep the action of order TV as TV — > 00 174 must be chosen 
so that c/4 = TV #4 has a finite limit. In this limit, the whole 
action becomes large and the theory is solvable in a saddle 
point approximation. It will be important later that the number 
of low dimension primary fields diverge as TV —> 00 i.e. (j> a is 
primary for all a = 1, TV. 

Many interesting phenomena are visible in the limit TV — > 
00, including symmetry breaking and a non-mean field transi- 
tion into the symmetry broken phase i25ll . The quantum phase 
transition to the symmetry broken state occurs at a particular 
value of g 4 and is described by an interacting conformal field 
theory. Nevertheless, the spectrum of primary operators in the 
large TV limit looks much like the spectrum of the free theory 
in the massless limit m 2 = 0. Finally, at the level of a micro- 
scopic lattice model, there are numerous simple rotor and spin 
models that have the above theory as a low energy limit. 

A more interesting kind of large TV limit is provided by 
gauged matrix models l24ll . In these models we consider 
a gauge group like SU(N C ) and consider theories of gauge 
fields A coupled to matter in various representations of the 
gauge group. The gauge field itself is in the adjoint represen- 
tation of the gauge group, which means that it may be under- 
stood as an TV C x TV C matrix. Similarly, the matter fields <E> 
may also be in the adjoint representation in which case they 
are also TV C x TV C matrices. To construct gauge invariant op- 
erators, we need only multiply such matrix operators and take 
a trace since adjoint fields transform like $ — > U C $U% with 
U c G SU (TV C ). These gauge models, which are referred to as 



matrix models, have TV = TV 2 and are the most directly rele- 
vant to holographic duality. Typical operators in these models 
are traces (over the matrix degrees of freedom) of products of 
adjoint fields e.g. tr(<E> 2 ) or tr(F 2 ) where F ~ dA + [A, A] is 
the field strength of A. 

A typical Lagrangian for a matrix model takes the form 

C = 4rtr ((L>$) 2 + m 2 $ 2 + $ 3 + $ 4 + ...) (2.19) 
9 

in terms of a matrix field $ and its covariant derivative = 
d$+[A, <£] Jl^]. If werescale <E>/g -> $ it becomes clear that 
g is a coupling constant. In this case we have TV ~ TV 2 and 
thus we must again tune g to zero as as TV gets large to have a 
sensible theory. The correct large TV coupling turns out to be 
A = ,g 2 TV c = g 2 N x / 2 which is called the 't Hooft coupling. 
It would take us too far afield to properly derive this result, so 
we restrict ourselves to a few comments. 

First, in the large TV limit perturbation theory simplifies so 
that only so-called planar diagrams survive. The planar dia- 
grams are then organized into a perturbation series in powers 
of A with all other non-planar diagrams suppressed by powers 
of 1/TV. Second, one simple way to understand the scaling of 
A is to look at the RG (3 function of the coupling. Typically 
we have 

d u g ''A,,/ ; • ... (2.20) 

with b ~ 1 and where we have integrated down to a length 
scale r = e ll e (e is a UV cutoff). We can now compute 

d u \ = 2gN c d u g = 2bN 2 g 4 = 2bX 2 (2.21) 

which shows that A has a sensible RG flow in the large TV C 
limit. Third, we note that the quartic coupling of the fields g 2 
goes to zero more slowly as a function of TV than in the vector 
model. This is a clue that the matrix model is more strongly 
correlated than the vector model. 

A particularly special matrix model is provided by N = 4 
super Yang-Mills theory in 3 + 1 dimensions J3- This theory 
has gauge group SU(N C ) and contains many matter fields all 
in the adjoint representation. To be specific, the theory con- 
tains six real scalars and 4 Weyl fermions as well as the gauge 
field. The scalars and fermions carry a large symmetry, called 
an R-symmetry, equivalent to SU(4) ~ SO(6). This theory 
is very special because the matter content is just right so that 
the j3 function for A vanishes for all A. Thus this theory ac- 
tually describes a fixed line of conformal field theories. All 
this is possible thanks to the very large amount of supersym- 
metry (J\f = 4) present in the theory. The most important 
scaling operators are so-called single trace operators of the 
form tr(Oi...O„) where Oi are adjoint charged fields e.g. the 
scalar, fermions, or field strength. At weak coupling the di- 
mension of such an operator would be roughly n. Later on we 
will meet the holographic dual of this theory. 

There are lattice models preserving a large amount of su- 
persymmetry which are believed to flow to the TV = 4 the- 
ory in the infrared l30ll . Considerable fine-tuning is still re- 
quired even with the large amount of lattice supersymmetry, 
but in principle entanglement renormalization can be carried 
out even for the Af = 4 theory. 
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C. Entanglement and entanglement renormalization 

Having introduced conformal symmetry and the idea of a 
large N limit, we now review entanglement in many-body 
systems. Entanglement renormalization illuminates the struc- 
ture of entanglement in CFTs in a way that is commensurate 
with scale invariance and suggestive of holography. 

Entanglement is an important kind of non-classical correla- 
tion that appears in composite quantum systems like the one 
shown in Fig. Q] A composite system AB is said to be en- 
tangled if the the state of the whole system \i/>ab) fails to 
factorize over the subsystems \4>ab) 7^ I^a) ® IV'-b)- A con- 
venient way to measure entanglement is provided by entangle- 
ment entropy or more generally entanglement Renyi entropy. 
The Renyi entropy is defined as 



S n (A) 



1 



1 



Mtr(^)) 



(2.22) 



where p \ is the state of A obtain by tracing out B. If the state 
of AB is pure then we have S n (A) = S n (B) and S n (A) pre- 
cisely measures the degree of entanglement between A and 
B. The entanglement entropy S(A) is obtained in the limit 
n 1 and enjoys various special properties. A related con- 
struction is provided by the mutual information, defined for 
two regions A and B in a larger system ABC by 



I{A,B) = S(A) + S(B) - S{AB). 



(2.23) 



Since S(AB) = if AB is pure, the mutual information in 
that case measures entanglement, but more generally the mu- 
tual information measures the total amount of correlation be- 
tween A and B. For example, the mutual information bounds 
connected correlation functions l3lll of operators in A and B 



X{A, B) > 



(OaO! 



\\Oa\?\\O l 



(2.24) 



To understand the basic scaling structure of entanglement in 
many-body systems, consider the following scaling argument 
iH[32l. Let r denote the length scale of interest in the many- 
body system and let A be a region of linear size R. Locality 
suggests that the entanglement between A and B at scale r is 
proportional to the size of dA in units of r, namely 



dS oc ( — 

r 



d-l 



(2.25) 



dS is also proportional to the RG measure dr/r and hence we 
should have 



r I r 



(2.26) 



Integrating from the UV cutoff r = e to an IR cutoff r 
min (R, £e) we obtain 



r min(,R,£ E ) /d 

dS= ' 7 



d-l 



(2.27) 




FIG. 1: Basic setup of entanglement entropy calculations. The sys- 
tem is divided into two components, here called A and B. The 
smaller of the two is A which has linear size R. The entanglement 
entropy S(A) is typically proportional to \dA\ which in d — 2 di- 
mensions would be R. This scaling of entanglement with boundary 
size is called the area law. 



for d > 1 and 



S ~ In (min (i?, £e)/€) 



(2.28) 



in d = 1. £e is the length scale beyond which the state has no 
more entanglement, so we integrate to £e or R depending on 
whether we first run out of entanglement or run out of degrees 
of freedom in A. The observation that the entropy is typically 
proportional to the boundary of A in d > 1 is called the area 

law (see Fig. [DlH B- 

Entanglement renormalization makes the picture above pre- 
cise and gives a powerful computational tool for many-body 
systems [QlEll]- Entanglement renormalization characterizes 
the structure of quantum states in terms of two operations, the 
removal of local entanglement and coarse-graining of degrees 
of freedom. We imagine the quantum state of the system at 
a given length scale r. The degrees of freedom at length r 
are grouped into blocks to be coarse-grained. Local entangle- 
ment is dealt with using short-ranged unitary transformations. 
These act at the edges of blocks to remove short-range cor- 
relations between blocks while preserving long-range corre- 
lations (blue squares in Fig. |2). In practical terms, the goal 
may be to reduce the support of the reduced density matrix of 
a block as much as possible. Coarse-graining is then imple- 
mented by isometries that map the most important states of 
a block into a renormalized degree of freedom (red triangles 
in Fig. |2). Isometries are like unitaries except that they sat- 
isfy W^W = 1 but not WW'* = 1. When the whole system 
is scale invariant, one expects to be able to remove the same 
amount of entanglement at each scale and then coarse-grain 
into equivalent degrees of freedom. We call the whole net- 
work the entanglement renormalization network or simply the 
MERA network for short. 

The use of unitaries and isometries is dictated by the desire 
to compute correlations efficiently. Typically, one would com- 
pute an expectation value of some local operator O using two 
copies of the network for the bra and ket in (O) . Let us imag- 
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Spin chain 

— o — o — o — o — o — o — o — o — o — o — o — o — o — o — o — o- 



State that encodes "entanglement per scale" 



-o — o — o — o- 

I I I I I 



-o — o — o — o — o — o — o — o — o — o — o — o — 
I I I I I I I I I I I I I 



i I ! i I i I i I i i i I i I i i i 

o o o o o o o o 

I I II 

I I = disentangler - coarse graining O — = site + bond 

i i T 

FIG. 2: Schematic structure of entanglement renormalization with 
d = 1 and k = 2. The blue squares represent unitaries, called disen- 
tanglers, that remove local entanglement. The red triangles represent 
isometries, called coarse-grainers, that thin out unentangled degrees 
of freedom. 



U U = l 



FIG. 3: An operator (green box) is coarse-grained using one layer 
of the MERA. Most of the isometries and unitaries of the layer will 
give 1 in this process (substitution in grey shaded region) because of 
identities like U ] U = 1. 



ine a scheme where k d sites are to be coarse-grained at each 
step. The Hilbert space of a site is assumed to have dimension 
X throughout the network (this can obviously be relaxed). The 
state at the UV lattice is related to the state at one lower layer 
by a layer of disentanglers U and coarse-grainers W as in the 
schematic expression |r = e) = (<E>U)(<E)W)\r = ke). Be- 
cause unitaries and isometries satisfy U'U = 1 and W^W = 
1 we immediately have 

(r = e\r = e) 

= {r = fce|(®W t )((giO' t )((8)U')((g)W)|r = ke) 

= (r = ke\r = ke). (2.29) 

We can then repeat the argument for the next layer and thus 
reduce to calculation of the inner product to the few IR degrees 
of freedom (in a finite size system). Similarly, if we want to 
compute (O) then most of the unitaries and isometries will 
simply cancel to give one, with only a few becoming mixed 
up with the action of the operator. In this way, both inner 
products and local correlations are efficiently computable. 

Let us look at the case of correlations in more detail. Di- 
mensions of operators can be computed using the disentan- 
glers and isometries of the tensor network l35ll . We renormal- 
ize an operator by applying a layer of of the network to the 
operator as shown in Fig. [3] and while most of the tensors 
act trivially, a few will lead to a non-trivial transformation of 
the operators. Operators that satisfy C(0) = AO, where C(») 
is the coarse-graining transformation, are called scaling oper- 
ators because they merely change by a constant factor when 
renormalized. When using a k d —> 1 scheme where k d sites 
(in d spatial dimensions) are coarse-grained into one we have 
A = fc~ A where A is the dimension of the operator. 

Another important feature of the en tang lement renormal- 
ization is the causal cone or RG image 12211 . The causal cone 
of a region A in the UV is the set of all sites, unitaries, and 
isometries in the network that can influence the state of A. As 
described in Refs. lffl, l2ltl . the causal cone shrinks exponen- 
tially fast as we coarse-grain until it is of order a few lattice 



sites (the details depend on the precise scheme, see Fig. 0J- To 
compute the entropy of a region A, we then start deep down in 
the causal cone of A, when the width of the cone is only a few 
sites and reverse the RG circuit to reach a more finely-grained 
scale. Sites not in the causal cone at the new scale can be 
traced out. Since these sites occur along the boundary of the 
causal cone, the total amount of entropy added is bounded by 
the size of the boundary of the causal cone. Roughly speak- 
ing, the entropy is bounded by the number of bonds cut times 
the maximal entropy per site hi ( \ ). We argued extensively in 
Ref. U] that this bound could be interpreted in terms of mini- 
mal curves (or surfaces) within the discrete network geometry 
of the MERA. For example, Fig. [5] shows how the number of 
bonds that must be cut to isolate a region can be greatly re- 
duced by descending into the network instead of isolating the 
region by cutting it off at the UV scale. We also argued that 
correlation functions could similarly be understood in terms 
of such minimal curves. 

The resulting discrete geometry, really a graph with a cer- 
tain connectivity, approximated smooth AdS space in the case 
when the lattice model was at a critical point. This notion 
of approximation can be made precise in a number of ways, 
including comparing lengths of corresponding curves in the 
graph and the smooth space and by comparing the spectrum 
of the graph and continuous Laplacian [l]]. Finally, we gener- 
alized entanglement renormalization to finite temperature and 
showed how black hole-like objects appeared in the construc- 
tion. Briefly, we considered a circuit Urq(T) whereby the 
thermal state p(T) was represented as 

p(T) = U RG (T) Po Ul G (T). (2.30) 

Although no longer purely described in terms of entangle- 
ment, the disentanglers can still be understood as removing 
short-range classical and quantum correlation. A black hole 
is represented by a situation where all correlation is removed 
and each site of the system sits in a maximally mixed state. 
Such a maximally mixed state corresponds to infinite temper- 
ature or zero Hamiltonian which is consistent with fact that 
the black hole horizon is a null surface where the local tem- 
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FIG. 4: The causal cone is the set of all unitaries, isometries, and sites 
of the network that can affect the state of a region in the UV lattice. 
The effective number of sites in a region, and hence the width of the 
causal cone, shrinks exponentially fast we descend in the network. 
This behavior persists until the causal cone width is of order a few 
lattice spacings where it will fluctuate depending on the details of the 
scheme. 



space 




FIG. 5: Entropy bounds in entanglement renormalization 
(schematic). The entropy of a region in the UV lattice (grey boxed in 
region) is bounded by the number of network bonds that must be cut 
to isolate it. The red curve is the corresponding minimal curve which 
is pierced by the minimal number of bonds. The length of this curve, 
suitably defined, or equivalently, the number of bonds cut bounds the 
entropy. 

perature diverges. 

D. Quantum expanders 

We have argued that entanglement renormalization natu- 
rally identifies area with entanglement. Hence to have a classi- 
cal area in the holographic interpretation of MERA we should 
find an object that adds a definite amount of entanglement. 
Enter quantum expanders. The definition of a quantum ex- 
pander begins with the notion of a quantum channel £. We 
imagine we have in our control a system A with some state 
PA- This system A is also in contact with an environment B 
with which it is initially uncorrected. The full state is thus 
Pab = PA® |0)(0|s where |0) is some reference state. We 
are allowed to act with a unitary transformation Vab on AB 
followed by discarding B. A quantum channel in its simplest 
form is then the sequence of steps 

PA -* PAB VabPAbV A B 

-+ \i b {VabPabVI b ) = £{ PA ). (2.31) 

This process is designed to model noise acting on quantum 
information in the form of a unitary transformation acting on 
a larger system (to which we do not have access) containing 
the system of interest. 



As a simple example, suppose A and B are two level sys- 
tems with states |0) and |1) and with V chosen such that 

V AB \00) = |00) (2.32) 

and 

Vab\W) = 111)- (2.33) 
If p A = W)h\>\ with |V) = c |0) + ci|l) then 

£(^) = |c | 2 |0)(0| + | Cl | 2 |l)(l| (2-34) 

and the state has gone from pure to mixed. We may interpret 
this particular quantum channel as decohering or measuring A 
in the {|0), |1)} basis since it copies the state of the system in 
this basis to the environment B. Note also that pa was pure 
while £(pa) has entropy of In 2, so we learn that quantum 
channels can add entropy (noise) to the system. 

It is a standard theorem (Stinespring dilation theorem) that 
any quantum channel can be written as 

£{p) = £ M aP Ml (2.35) 

a=l 

where the M a are called Kraus operators and satisfy 

MtM a = 1 (2.36) 

a 

(proven by tracing over A and using the fact that V is uni- 
tary). Indeed, the operators M a are nothing but particular 
sub-matrices of V, 

(M a ) i:j = {V A B)i a ,jo, (2.37) 

so that 

\a=l / u 

= y~](VAB)ia,jOPjk (V AB )fcO,;c« 
a 

= Y.iMVABip® |0)<0|)Fj fl |Za>. (2.38) 

a 

Note that all we really require to prove this theorem is that the 
initial state of AB be uncorrected. 

A quantum expander is simply a quantum channel where 
the matrices M a satisfy 

M a = —^U a (2.39) 

with U a unitary and where the expander is guaranteed to add 
entropy to the state 1261 l27ll . The latter condition is obtained 
by requiring the spectrum of the expander to have a gap. 
Eigenoperators of £ are operators such that 

£{0) = AO (2.40) 
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and the set of all A is the spectrum of £. It immediately fol- 
lows that the operator 1 is an eigenoperator with eigenvalue 
Ai = 1. The spectrum is called gapped, with gap 5, if the 
next largest eigenvalue satisfies | A2 1 < 1 — S. It is conve- 
nient to choose the quantum expander with a max even and 
U a +a ma /2 = Ul for a = 1, ...,a max /2. This insures that 
the quantum expander is formally a Hermitian operator. 

To see what this gap condition has to do with adding en- 
tropy, consider the inner product on the space of \ x X matri- 
ces given by 

(M,N) =ti{M^N). (2.41) 

The normalized operator 1 fs/x i s an eigenoperator of £ with 
eigenvalue 1. The gap then implies that all other operators 
are rapidly shrunk the by quantum expander. Suppose we be- 
gan with a pure state po = l 1 /')(V'l ar, d applied £ n times to 
generate a sequence of states {pi} satisfying 

Pn = £{p n -i). (2.42) 
Since po is normalized we know that its overlap with 1 / sjx is 

(p ,l/Vx)=X~ 1/2 (2-43) 

and hence 

Pn ~ - + (1 - 5) n 2 + ... (2.44) 
X 

This implies that p n approaches the uniformly mixed state ex- 
ponentially fast and hence we should expect the entropy to 
increase linearly with n. Of course, how much entropy is ac- 
tually added depends on the whole structure of the quantum 
expander. 

For another perspective, consider the state pi given by 

Pi=£ — UcMWWi- (2-45) 

a 

To the extent that the states U a \ip) are orthogonal for differ- 
ent a, this expression has In (a max ) more entropy that po = 
(which had none at all). In the same way, so long as 
the states U a U a >\ip) are orthogonal for a, a' not equal, the 
state P2 has still more entropy, 2 In (a max ) to be precise. Thus 
we see again the linear increase in entropy. 

Although we will not directly use the above definition of 
a quantum expander, these objects are similar to the coarse- 
graining transformations we consider below. The main dif- 
ference will be that while most of the eigenvalues A will be 
very small, a sparse number of larger eigenvalues will per- 
sist even when a max and \ are verv large. Nevertheless, these 
sparse large eigenvalues are not enough to prevent the coarse- 
graining transformation from removing (or adding) a large and 
definite amount of entropy from each application. 

Typically we will take \ ~ e N with TV our rough measure 
of the total number of local degrees of freedom. It was shown 
in Ref. l36ll that in the context of quantum expanders where 
the unitaries U a are drawn randomly from the Haar measure 
then the second largest eigenvalue is roughly 2/^/a max which 



is very small if a max ~ \- As an aside, in the quantum infor- 
mation context one typically takes a max to be relatively small 
(it is analogous to the valence of nodes in a classical expander 
graph), but our needs are cruder than the full blown small a max 
quantum expander. 

While generating Haar random unitaries of size e N x e N is 
exponentially hard in TV i.e. we must apply an exponentially 
large in TV number of elementary unitaries, our physical intu- 
ition in the area of thermalization tells us that it is not so hard 
to produce very mixed looking density matrices. Formally, 
we can invoke a unitary t-design J37ll which is simply a mea- 
sure on the unitary group that agrees with the Haar measure 
for all polynomials in U of degree less than t. Since these t- 
designs can be efficiently realized it follows that using generic 
i.e. Haar random unitaries is not unphysical provided we don't 
really need arbitrary powers of the unitary. A recent power- 
ful result is Ref. [3_8j which shows that circuits composed of 
small random unitaries are approximate polynomial designs. 

An extreme form of thermalization known as scrambling 
has been ascribed to black hole dynamics and large TV matrix 
models, so our considerations are not totally foreign to holog- 
raphy I39I1 , Scrambling is an extreme form of thermalization 
where the system is globally thermalized, often at infinite tem- 
perature, and looks random even to non-local probes (see Ref. 
[? ] for a discussion). This can be contrasted with local ther- 
malization, hydrodynamic relaxation, and the equilibration of 
local observables (all of which must happen before full scram- 
bling can occur). 



III. LARGE TV ENTANGLEMENT RENORMALIZATION 

The generic system we consider has sites supporting a lo- 
cal Hilbert space dimension of roughly \ ~ eN ■ Sites are 
arranged into a regular graph in d spatial dimensions. Let 
be the ground state of a local Hamiltonian H acting on 
this graph. Throughout this section we consider entanglement 
renormalization for | fi) . We assume H is translation invariant, 
THT~ X = H, where T is translation by one unit cell. We 
also assume that the system is scale invariant e.g. at a critical 
point. Later we will relax the second assumption and consider 
more general kinds of Hamiltonians and states. The most in- 
teresting physics emerges when we consider strongly coupled 
systems where all TV degrees of freedom interact strongly. We 
sometimes refer to |fi) as the UV state \UV) and the original 
lattice as the UV lattice, k denotes the RG scheme i.e. we take 
k d sites into 1 site after coarse-graining. Note that the MERA 
network is not translation invariant by constructions, so trans- 
lation invariance is a non-trivial condition on the tensors. 

To access the structure of strongly interacting large TV the- 
ories we assume that the tensors in the network are generic 
up to constraints imposed by symmetries. This single as- 
sumption underlies our considerations of operator dimensions 
and area/entanglement relations. The large TV parameter is 
needed to give an expansion parameter, while the assump- 
tion of strong coupling is necessary to assure that the system 
doesn't decompose into many weakly coupled pieces. Only a 
strongly coupled large TV system will have disentanglers and 
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coarse-grainers that are truly quasi-random or scrambled. 

A. Operator dimensions 

Let us dispense with the trivial case first. Suppose for a 
moment we did have N non-interacting copies of a local sys- 
tem on each site. In this case the disentanglers and coarse- 
graining transformations would factorize e.g. U — U® N for 
a given disentangler. We see immediately that the coarse- 
graining transformation C will have many eigenvalues of var- 
ious sizes all throughout its spectrum. Indeed, if O is a scal- 
ing operator in the single copy theory with dimension A, then 
we can trivially form composite operators ®iOi with dimen- 
sion Y^i &4 and hence the spectrum of operator dimensions is 
quite dense. This situation actually describes the large N vec- 
tor model ll25ll . and the weakly coupled matrix model before 
we project onto gauge invariant states. 

To obtain a sparse spectrum of scaling dimensions we 
must consider more strongly interacting systems. The coarse- 
graining operation C acting on an operator O is 

C{0) = V Ji OV (3.1) 

where the operator V is a shallow depth quantum circuit, the 
product of the all the unitaries U and isometries W in a single 
layer of the MERA. This operator is itself an isometry and we 
note that if the operator O is local then many of the unitaries 
and isometries in V simply cancel to give factors of 1. We 
model strongly interacting systems by assuming the unitaries 
and isometries strongly mix the Hibert space on which they 
act. The similarity to a quantum channel, and in particular 
quantum expanders, should hopefully be clear. Now we can 
ask a sharp question, namely what is the structure of matrix 
elements of large generic isometries? 

Within MERA and more generally, isometries can be un- 
derstood as unitaries where some of the matrix indices are set 
to fixed values. For example, consider a unitary transforma- 
tion U : Ti 2 — > Ti 2 (dim(H) = \) and define the isometry 
W : H -> U 2 by W\ip) = U\ip)\0) where |0) is a reference 
state. In this sense an isometry is simply a unitary restricted 
in its input or output. Indeed, we can verify that W^W = In 
while WW^ = P where P projects onto the image of W. 
Now if the unitary U is generic, then its typical matrix ele- 
ments are of order 1/x- Furthermore, the action U'(»)U will 
typically turn all operators of the form (O ® 1) into the iden- 
tity. We consider operators of the form O ® 1 because they 
mimic the fact that local operators act on a small subset of the 
sites. 

Here is a formal argument. Since U is generic, we may as 
well average over all U, thus consider 

J dU H aarUHO®l)U. (3.2) 

U is a x 2 x X 2 unitary so we have 

f dUUuiU^KL = \Sjk5il- (3.3) 



Each of IJKL is a composite index running over x 2 values, 
so we can write 

\ Y, 6 JkSil(0 ® l)j K = t ^5iL- (3.4) 

X JK X 

Here the trace is not the matrix trace above, but the trace over 
the full Hilbert space. If O is traceless then this is simply zero, 
but more generally all operators are immediately reduced to 
their trace. Clearly this represents an extreme limit. 

To estimate corrections to this picture, consider the operator 
O = \n)(n\. We find that 

(n0\U\O ® l)U\nO) 

= Yi^P^nfyinkplnO) 

k 

= Y\{nO\U^\nk)\ 2 

k 



The last line follows because we sum up x numbers of rough 
size 1/x 2 - Hence the operator O is reduced by a factor of 
1/x by the action U'(»)U and other operators are generated. 
Most diagonal elements are now also of order 1/x while off- 
diagonal elements are of order 1 /x 3 ^ 2 since phases can cancel 
in this case e.g. adding x random phases gives a number of 
size y/x- Similarly, differences in diagonal elements are of 
order 1/x 3 / 2 . Thus we can say that corrections are of order 
1 /x p for some small number p. This agrees with the perspec- 
tive from quantum expanders, although again, we are not pre- 
cisely considering quantum expanders. In that setting, if U is 
generic then we would have roughly In (a max ) ~ In (x) and 
hence the expander eigenvalues are of order l/x p according 
toRef. M- 

Repeating this discussion for the actual isometries compos- 
ing C for which x ~ e A \ it follows that the coarse-coarse 
graining transformation, when acting on local operators, will 
almost certainly reduce the operator in size by at least a fac- 
tor of e . Since the eigenvalues of the coarse graining op- 
eration are A = fc~ A , our estimate shows that most scaling 
dimensions are very large. Thus we have two important re- 
sults. First, most scaling dimensions are large in a strongly 
coupled large N theory, and second, at fixed coupling these 
dimensions increase with the number of degrees of freedom. 

In some special cases we can interpolate between the limit 
of completely random tensors and tensors that decompose into 
many smaller pieces. For example, in the Af = 4 theory the 
coupling can be varied arbitrarily while preserving conformal 
invariance. Thus at fixed N we can smoothly go from the 
weakly mixed case to the strongly interacting case with an 
accompanying charge in the spectrum of scaling dimensions. 
The fact that a factorized (weakly coupled) large N disentan- 
gler can still add or remove a lot of entanglement indicates 
that the amount of entropy added is partially independent of 
the spectrum. Thus in principle there can be multiple scales 
in the spectrum, set by parameters like N and by the cou- 
pling. In other words, the spectrum is not completely fixed 
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by specifying the entropy added. We would need a detailed 
model to determine all the scales, but we emphasize that our 
basic argument is general: a large A" strongly interacting the- 
ory will have quasi-random tensors, and such quasi-random 
tensors lead to a sparse spectrum of dimensions. 

If the disentanglers and coarse-grainers in the MERA were 
truly random then the above result would be the whole story, 
but this is physically incorrect. Indeed, a state with random 
tensors is not translation invariant. Furthermore, there must 
at least be a stress tensor in the theory which transforms as a 
scaling operator of low dimension. This is required by confor- 
mal invariance. Let h be the energy density scaling operator 
with dimension A^. In a theory in d dimensions with dynami- 
cal exponent z we have A;, = d+ z and thus h is a low dimen- 
sion operator unless z gets large. If the MERA represents the 
ground state of the large A" model with energy density h then 
h must be among the eigenoperators of the coarse-graining 
transformation. Similarly, if the theory has a bosonic symme- 
try, the conserved current of that symmetry will also have a 
small dimension. Supersymmetry can protect the dimensions 
of certain operators insuring that they remain low dimension 
operators. Importantly, while all these symmetries impose 
constraints on the structure of the disentanglers and isome- 
tries and hence lead to low dimension operators, the generic 
statement remains that most local operators have a very large 
dimension under the coarse graining transformation. 

Besides these primary operators we also have descendent 
operators, essentially derivatives of primary operators. Trans- 
lation invariance of the MERA guarantees that these operators 
are also in the spectrum of the coarse-graining superoperator. 
Indeed, take a scaling operator O and consider 

Q0 ^ O(x) - O(0) = T*O(0)T-* - O(0) 

x x 

where again T is the translation operator. The translation in- 
variance of the MERA means that TOT -1 has the same di- 
mension as O and hence 

k~ A 0(x/k) - fc- A O(0) 



C{dO) 



= hr 



_ x O(x/k)-Q(ti) 
x/k 



fc _A - 1 aO. (3.7) 



The last equality implies that dO has dimension A + 1. 

Finally, let us address the question of product operators. 
Suppose Oi are two low dimension operators, then what can 
we say about the dimension of O1O2? The operator product 
expansion in Eq. 12.151 is clearly relevant here. In general, 
a product of two scaling operators will generate many other 
scaling operators. We know, for example, that if tr{0\02) 7^ 
then the identity operator will be among the operators gener- 
ated. This follows because the action of C on an operator can 
be expressed as C(») = J2 n ^nO n {O n ,*) (assuming C is her- 
mitian) where the O n are normalized eigenoperators. Large A~ 
factorization of correlation functions takes many forms, but 
here let us simply note that in the context of holography it 
implies that certain so-called double trace operators, which 
are essentially squares of single trace operators, have a di- 
mension A d = 2A S + 0(N- p ) JUIH]. In our context this 




FIG. 6: Schematic of a k = 3 MERA in d = 1. A site on the lower 
layer has entropy at most N and the red isometry preserves this when 
moving from the lower to the upper layer. The blue disentanglers add 
roughly 2N entropy to the entropy of the three site block which is 
consistent with an entropy bound for the block of 3N. 



follows from the following argument. Low dimension opera- 
tors, by definition, largely decouple from the mixing dynam- 
ics generated by the coarse-graining transformation. Thus we 
expect that these operators are effectively weakly interacting 
and hence their dimensions should approximately add. High 
dimension operators, which are strongly coupled to the mix- 
ing dynamics, are not expected to satsify the same property. 



B. Entropy and area 

Now we turn to the question of the emergence of area. We 
reviewed the arguments of Ref. jj] that the MERA naturally 
encodes an emergent holographic geometry in which minimal 
surfaces provide bounds on entanglement. Does this bound 
become sharp in the large A" limit? It has already been ob- 
served numerically that a single layer of the MERA in d = 1 
tends of add the same amount of entropy at each step of the 
RG. This is less surprising when we realize that the k d — > 1 
coarse graining transformation implements a lattice version of 



the unitary operator V(D, k) 
generator) so that 



-iln(fc)X) 



(D is the dilatation 



C(.) ~ V^(D, k)(»)V(D, k). 



(3.8) 



Remarkably D does indeed have a gap, Aq, in its spectrum, 
so the spectral values A in 



AO = V\D, k)OV(D, k) 



(3.9) 



satisfy Ai = 1 (Oi = 1) and A 2 = fc~ A ° < 1. Thus the con- 
tinuum operator V(D, k) looks similar to a quantum expander 
even at finite N . This argument helps explain why the MERA 
was observed numerically to contribute the same amount of 
entanglement at every scale. 

There is an even stronger statement at large N. If the disen- 
tanglers of the MERA are generic in the above sense, then it 
follows that they will add or remove a definite amount of en- 
tanglement when acting on most states. Let us understand the 
analogous process for a quantum expander. Consider again 
the transformation of density matrices given by 



-, a max 

£ = J2 u <*p u I 



(3.10) 
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As we showed above, this transformation arises as unitary 
transformation on a larger space and the number a. max is a 
measure of the entangling power of that unitary. We know that 
if the U a are random then this transformation is a quantum 
expander that adds entropy of ]n(a max ) to all but the most 
mixed states ll36ll . If the a max ~ e N then this transformation 
will turn every density immediately into the completely mixed 
state. However, this is not quite what we want since we're not 
adding a definite amount of entropy (we're adding just enough 
to reach the maximally mixed state). 

We want to make sure the quantum expander adds a large 
amount of entropy, of order N, but also that it doesn't com- 
pletely saturate the entropy. In other words, with In (a max ) = 
fN the quantum expander will add a definite amount of en- 
tropy to states whose entropy is not already too high (S < 
(1 — f)N). Of course, the unitaries we are considering in 
entanglement renormalization do not precisely lead to quan- 
tum expanders (for example, the inputs may already be corre- 
lated from deeper within the network), but it is still true that a 
generic unitary acting at the boundary will add an entropy of 
order N unless the input state is extremely highly entangled 
already. Thus we see one of our main results, that generic 
tensors in the MERA will add definite entropy and have a 
spectrum of mostly high dimension operators. Although we 
again emphasize that the tensors are not totally random since 
we must have some low dimension operators. 

Adding a definite amount of entropy is good, but we also 
want to see that the tensors correspond to a large local geome- 
try. To motivate this, consider that the eigenvalue of a scaling 
operator is a measure of how easily the operator propagates 
through the network. Now without modifying the actual net- 
work, let us imagine grouping sites in a given layer into su- 
persites of size k. To renormalize this supersite into a single 
site somewhere deeper down requires a larger piece of the net- 
work, to be precise we now need log fe (fc) RG steps. Using this 
"coarse-grained" coarse-graining operator, we see that scaling 
operators now effectively get reduced by a bigger amount. But 
this has a simple interpretation since the operator has really 
just moved further in the network (multiple layers). Thus the 
appearance of small dimensions is naturally interpreted as the 
existence of an effective large distance in the network. 

Even at large N and strong coupling some of the operator 
dimensions are small, so we know that we haven't just triv- 
ially "coarse-grained" the coarse-graining transformation, but 
on the other hand, many operator dimensions are large, so it is 
as if these operators are moving over a much greater distance 
than the low dimension operators. This suggests the existence 
of two lengths scales, one associated with low dimension op- 
erators and one with high dimension operators. Furthermore, 
if we measure distances in terms of the length scale of the high 
dimension operators then a single step of the coarse-graining 
transformation moves operators a large distance. 

Another potential concern, noted above, is the appearance 
of (nearly) maximally mixed state, to which adding entropy 
is (nearly) impossible. To address this concern, consider the 
situation in d = 1 with k = 3. Each site in the network has 
an entropy of at most N, and when we reverse an isometry to 




FIG. 7: Part of the RG image of a block in the UV lattice (green 
block). The RG image is the grey shaded region with the green bor- 
der. It continues down into the bulk of the network. 



reach a less coarse-grained level, 

p^W P W\ (3.11) 

the entropy of the resulting state is identical, see Fig. [6] How- 
ever, the resulting state resides in a large Hilbert space of di- 
mension e 3N and hence has maximum entropy of 37V. When 
we now act with the disentanglers on edges of the new three 
site state, we will generate a large amount of entropy since the 
state of the three sites is far from maximally mixed e.g. it has 
only 1/fc-th of its maximal entropy (with k = 3). Each dis- 
entangler will increase the entropy by TV still well in line with 
the total entropy capacity of the region. More generally, for a 
k d —> 1 scheme in d dimensions, the entropy of a block before 
the disentanglers act no more than N, but since the maximum 
entropy of the region is k d N, the 2dk d ~ 1 disentanglers acting 
on the boundaries of the block (based on assuming a hyper- 
cubic block of sides k) can still easily add roughly N to the 
entropy. Indeed, we only require k > 2d or otherwise only 
add j^N entropy per disentangler to be consistent with the 
bound. 

Let SSu denote the entropy added by the disentangler U 
to all but the most mixed states. The total entropy added to 
a region is then simply 5Su times the minimum number of 
bonds that must be cut to isolate the region. Thus in the dis- 
crete geometry of the MERA network, the entropy of a region 
is, up to a factor of SSjj ~ N, simply the minimal number 
of links piercing the RG image of the region. The RG image 
of a region is the causal cone we discussed above, namely all 
the sites and tensors that can influence the state of the field 
theory region. Part of the RG image of region in the UV lat- 
tice (green block) is shown in Fig. [7] Thus to leading order in 
N the entropy of a region is completely fixed by the minimal 
curve needed to isolate it; corrections only occur at order TV . 

The geometry of the network is still coarse because of the 
lattice structure. However, we argued that a smooth geom- 
etry emerges from a combination of the coarse geometry of 
the tensor network combined with the large tensors within the 
network. In other words, each disentangler, say, of the tensor 
network is really a chunk of a smooth geometry. One might 
be worried that these chunks are nevertheless combined in a 
very non-smooth fashion, but actually this is an illusion. In a 
strongly coupled system, the degrees of freedom on any one 
site are strongly mixed with those of neighboring sites so that 
the distinction between different sites really loses distinction. 
A sharp version of this statement is provided by translation 
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FIG. 8: RG image or causal cone of two nearby regions. Notice how 
the regions merge under coarse-graining before they shrink to the 
lattice scale. 



invariance. Although different sites are formally treated dif- 
ferently by the MERA, translation invariance guarantees that 
we will get the same result by shifting the entire network by 
one unit cell. The resulting network structure looks quite dif- 
ferent but still describes the same state. Thus the sites of the 
network are really losing meaning deep into the bulk. It is also 
interesting to observe that in the context of various topological 
theories, the bulk network structure of the MERA is explicitly 
deformable in that new degrees can be exactly introduced into 
anywhere in the network 114011 . The topology and connectivity 
of the network is thus flexible, and the smooth geometry of 
the bulk is all the more reasonable in this case. Our freedom 
to perform the RG in different ways may correspond to bulk 
diffeomorphisms . 



X = 2SSu lo 



I? 2 



(2R + x)x 



(3.14) 



which reproduces the results of Ref. H32H including the holo- 
graphic computation. 

To give a higher dimensional example consider the case of 
d = 2 with A and B two long blocks of length R, width W, 
and separated by a distance x (see Fig. [12] for an example 
of one such block). The number of bonds cut by the merged 
region is 

log fe (x/e) log, ((2W+x)/e) 

V A + V ^ (3.15) 



n=0 n=0 

while the number of bonds cut by each region separately is 

log fc (W/e) 



ER 

n=0 



(3.16) 



Hence the mutual information is 

log fc (Wye) 

* E 

n=0 



SSu ^ k n e 



log k (x/e) log fc ((2W+x)/e) 

V V (3.17) 



C. Mutual information 

We have now discussed the relation between the form of 
the spectrum of scaling dimensions and the emergent geomet- 
ric structure of entanglement. The structure of the mutual in- 
formation follows as a natural corollary from the structure of 
entanglement and correlations. Consider two regions A and B 
in the field theory of linear size R separated by a distance x. 
We ask what happens to these regions when we coarse-grain 
them using entanglement renormalization. There are two lim- 
its to consider depending on whether x ^> R or x <C R. 

If x <C R then A and B will quickly be merged by the RG 
flow after a number of steps given by log/, (x/e) as shown in 
Fig. [8] The number log, (x/e) is simply how many RG steps 
to reduce i to f and where x shrinks by 1/fc at every step. 
This kind of number, which is roughly the RG time needed 
to shrink a region to the lattice scale, will appear again and 
again below. In one dimension the number of bonds cut by 
the resulting composite region is then 



21og fc ((2i? + a ;)/e) + 21og fc (x/e) 



(3.12) 



where as each region by itself would cut 2 log fc (R/e) bonds. 
The mutual information is thus 

4 log, (R/e) 



SSu 



-2 log fc ((2R + x)/e) + 2 log, (x/e) (3.13) 



or using 



JV 

71=0 



-N-l 



1 - k- 



(3.18) 



we have 



1 = 



SSu R 



1 - k- 
SSu R 
c-1? 



2-2 l -± 
kW 



k x 



1 

1 + r 



k 2W + x 
(3.19) 



This is again what was found in Ref. 13211 on scaling grounds 
and in holography. Note also the disappearance of the UV 
cutoff as in d = 1. 

What about the opposite limit where x ^ Rl In this case 
both regions are renormalized to the cutoff scale, taking about 
log fc (R/e) RG steps, long before they merge. Thus all cor- 
relations between A and B are due to point like operators 
originating at the tips of the minimal surfaces (see Fig. [9]). 
However, we have already seen that the spectrum of operator 
dimensions is such that only a few low dimension operators 
exists. All other operators, even if present, contribute only 
very tiny corrections, of the order of (R/x) Al " ec . Thus the 
mutual information rapidly crosses over from a large order N 
piece (recall that SSu ~ -^0 to a order one decaying power 
law at large distances mediated by the few low dimension op- 
erators. Note that this N dependence follows from our result 
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FIG. 9: Mutual information in the limit x ^> R. The two regions 
rapidly shrink to the lattice scale and are then only coupled by a few 
low dimension operators (dotted green line). 



for the large N entropy above since the minimal surfaces for 
two distant regions will simply be the minimal surfaces for 
each region separately. Thus all the N dependence should 
cancel. 

Since the density matrix pab is nearly factorized in the 
large distance limit, we can compute the mutual information 
in perturbation theory. The density matrix is assumed to be 
Pab = PaPb + $P with ^ab{^p) = 0. We also define 
5 pa = u~b(Sp) and similarly for B. The mutual information 
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FIG. 10: Trajectories of renormalized operators in the network. 
Panel A shows the three-point function while panels B and C show 
the four-point function in two different limits. Note the similarity to 
Witten diagrams Jnll . 



I = -tiA((pA + Spa) In (pa + 5p A )) 
-tr B ((p B + 6 p B ) In (p B + 5p B )) 

+tV A B{{pAPB + 6p) In (pAPB + Sp)). (3.20) 
Expanding to first order in Sp we find 

I = -Ka(PA In (pa) + Sp A In (pa) + PaP~a^Pa) 
-Kb(pb In (pb) + 5p B In (p B ) + pbPb^&Pb) 

+ttAB(pAPB In (paPb) + Spin (paPb) + Sp), (3.21) 

but this expression actually vanishes. Thus we have 1 ~ Sp 2 
and since 



(O a O b ) =u- A b(OaO b 6p) 
for Oa.b scaling fields, we expect 

(OaOb) 2 



WOaUOe 



(3.22) 



(3.23) 



for the correlator of the lowest dimensions operators Oa,b- 
Of course, this is consistent with the bound in Ref. lf3lll . Or- 
der 5p terms can appear when considering the Renyi mutual 
information i4lll . 

Finally, let us compare these strongly coupled results to the 
situation in a weakly coupled theory. For the decoupled vector 
model, the mutual information also decays at large distances 
like a power law, but the coefficient is not order one but order 
N. However, the vector model is a non-generic example be- 
cause the number of primaries with small dimension diverges 
as N —> oo. If the model is weakly coupled but does not have 
this property, then the spectrum of operator dimensions still 
won't be sparse, but also won't be diverging. The picture in 



Fig. |9]is then still valid and although the entanglement geom- 
etry is not local at small scales, it still follows that the mutual 
information has an order one (instead of order N) decay at 
long distances. 



D. Correlations, perturbations, and the causal cone 

We now turn to perturbations of the state, for example, in 
the context of computing correlation functions or studying im- 
purity problems. An immediate consequence of the causal 
cone structure of the MERA is that most tensors are not af- 
fected when considering a localized perturbation. For exam- 
ple, local correlation functions may be computed by mov- 
ing scaling operators through the network using the coarse- 
graining operator, but the only sites affected are those in the 
causal cone of the sites where the operators are supported. 
For two scaling operators separated by a distance x we must 
renormalize log fe (x/e) times to bring the operators adjacent 
to each other in the network. Since each operator acquires 
a factor of fc~ A after each RG step, the resulting correlation 
function goes like 



(fc— ) 



2Axlog fe (x/e) 



2A 



(3.24) 



as expected. This is strongly reminiscent of the geodesic ap- 
proximation in holography, yet the computation here is exact. 
A curious feature of the large N limit is that the scaling op- 
erators have, in a sense, a lot of room to move within a given 
site. They typically represent simple operators, products of a 
few fields, and hence are not very efficient at moving around 
within the local states. 

As shown in Ref. |1], a similar calculation is possible 
for the three point with the result being a slight generaliza- 
tion of the conformal result (which is appropriate since the 
MERA can represent scale invariant theories that may not 
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be conformally invariant). For example, consider three op- 
erators 0\, O2, and 3 at x\, x%, and x 3 and suppose that 
[aji — a~2 1 "C [£C2 — 1 ■ Then operators 0\ and O2 are first 
merged by the coarse-graining transformation into some op- 
erator 0\i and then this operator and operator O3 are merged 
much deeper in the network. We have shown the schematic 
trajectories of the operators through the network in Fig. [TOl 
Conformal invariance implies that A12 = A3, since otherwise 
(O12O3) vanishes (more generally, the smallest dimension 
operator with non-vanishing correlator with 3 contributes). 
The coarse-graining transformation then gives the following 
factors, with \x\ — x<z\ = £< and \x\ — Xs\ ~ \x% — Xs\ = £>, 

If A3 = A12, these factors reduce to 

/ ^Ai+A 2 -A 3 / \ 2A 3 

(0x0,0b) ~(^J y • 0.26) 

which is the usual CFT result 

(O1O2O3) ~ ^ Al +A 2 -A 3 ^A 2 +A 3 -Ai ^A!+A 3 -A 2 

up to factors of e. Like the two-point, a geodesic-like structure 
is evident here as well. 

Let us consider then the case of a four-point function which 
is not completely fixed by conformal invariance. Let the oper- 
ators sit at X\, Xi, £3, and x 4 with \x\ — x 2 \ — \x 3 — x 3 \ = l<, 
\x2 — x-s\ — l > , and f< < 4- The operator trajectories are 
again shown in Fig. [10] Operators 0\ and O2 and operators 
O3 and O4 are quickly merged together by the network, but 
now a new feature appears, since the merged operators O12 
and O34 can take a variety of values and still contribute to the 
four-point function. For example, we conformal invariance 
merely forces A12 = A34. Hence the four-point in this limit 
is given by 

, N A 1 + A 2 , X A 3 +A 4 , f x2A Q 

?(£) {£) it) 

where the sum over a is over all operators and the OPE coef- 
ficients cfj the amplitudes of various fused operators to form 
new primary fields (see Eq. 12. 15b . This sum depends on 
the full operator content of the CFT and is how effectively 
how the four-point function becomes non-trivial once we re- 
lax the condition of £> ^> £ < . We can of course also consider 
other kinds of limits, for example, if \x2 ~ x%\ — £ < and 
|^i—^2| = I ^3 — 1 = i> then we would obtain a similar 
sum with a factor of C2 3 c" 4 . Comparing these forms in dif- 
ferent limits leads to what is known as crossing symmetry in 
CFT; it is a non-trivial constraint on the dimensions and OPE 
coefficients on the theory. 

A general theme of all these computations is the simplifi- 
cations that occur due to the causal cone structure. Most of 
the data in the tensor network is not necessary to compute 
correlation functions of a few fields or even the entropy of a 



region. Thus we have a notion of RG causality. This kind 
of behavior, where, say, an impurity only affects the network 
locally, has been observed in the context of boundary critical 
phenomena in the MERA [42]. A similar sort of locality has 
been observed in the context of holographic duals of bound- 
ary CFTs i43ll . These considerations are also very reminiscent 
of the ideas in Refs. iH, l45tl where an attempt was made to 
identify bulk regions in a holographic dual that completely en- 
coded the observables of a given field theory region. It would 
be very interesting to carry these comparisons further, and to 
perhaps further study the locality of the bulk RG flow in differ- 
ent contexts e.g. holographic renormalization and Wilsonian 
renormalization ll46ll . 



E. Other geometries 

Having discussed extensively the structure of scale invari- 
ant situations, we can obviously consider other kinds of ge- 
ometries. The simplest kind of confining geometry is simply 
one in which the disentanglers successfully remove all entan- 
glement after a certain length scale. As discussed in Ref. 01, 
the holographic tensor network geometry should be under- 
stood as being smoothly capped off in this case. Correlation 
functions are identically zero beyond a certain length scale, a 
hard version of the expected exponential decay. 

A more subtle possibility is a situation in which most of the 
large N degrees of freedom are confined, but a few degrees 
of freedom remain at low energy. For example, in the context 
of the gauged matrix model, the gauge theory could confine 
at low energies thus removing the large N number of degrees 
of freedom. However, a few low lying modes could remain 
because of symmetry breaking or some other phenomena. In 
this case our large N considerations only apply above the con- 
finement scale while below it we recover a MERA description 
with a small number of local degrees of freedom. This phase 
of the large N theory could be described as "non-geometric" 
although we still have the discrete geometric structure of the 
tensor network even at small N. 

We can also imagine the effective dimensionality of space 
changing. For example, in Ref. ll47ll it was shown how by 
turning on a magnetic field one could start with a homoge- 
neous 3 + 1 dimensional system and then flow at low energies 
to a system of many decoupled 1 + 1 dimensional systems 
with a logarithmic violation of the area law. The holographic 
version of this story was worked out earlier in the form of a so- 
lution that interpolates between AdSs and AdS3 x R 2 . We can 
easily implement the same kind of RG flow in the MERA. As 
we descend in the network the disentanglers between different 
one dimensional chains will smoothly go to one as the chains 
decouple. The chains will eventually totally decouple if all en- 
tanglement between degrees of freedom can be removed. In 
this case, sufficiently low energy correlators between different 
chains will vanish, a counterpart of the exponential vanishing 
of correlators in the x-y plane in the holographic setup. 

Finally, let us consider the case of a Fermi surface. Since 
the Fermi surface is associated with a logarithmic violation of 
the area law for entanglement entropy 1481 - DOll it has long been 
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observed that the conventional MERA cannot capture the en- 
tropy scaling of a Fermi surface with a constant bond dimen- 
sion. Of course, the preceding construction does give a state 
in 3 + 1 dimensions with constant bond dimension and loga- 
rithmic scaling of the entropy, but it is highly anisotropic. A 
related construction known as branching MERA |0] can give 
Fermi surface scaling for the entanglement entropy, but it in- 
volves a proliferation of different branches of the MERA at 
low energies. To understand the physics of this situation, let 
us examine the dynamics of the Fermi surface more closely. 

The Fermi surface of a free fermion system is simply the 
boundary of the region containing occupied states in momen- 
tum space. In a Lorentz invariant system perturbed by a chem- 
ical potential, this surface would typically be a sphere S^ 1 
(in d spatial dimensions) of radius kp in momentum space. 
Now imagine probing this surface with the degrees of free- 
dom at length scale r. Because these degrees of freedom can 
only probe momentum differences 8k ~ \/r, the Fermi sur- 
face looks coarse to such probes and is effectively divided 
into (kpr)^ 1 patches. Thus we see that coarse-graining in 
real space is equivalent to fine-graining in momentum space. 
Furthermore, if the effective number of low energy degrees is 
growing as we renormalize towards the IR, we have a simple 
model of the area law violation. If each mode at scale r con- 
tributes roughly (L/r) d_1 entropy and if there are (kpr) 11 " 1 
such modes (see Fig. [TTJ, then the entropy should scale like 

S ~ [ T ( r ) " 1 ~ ^ kpL ^ 1 111 ( L / £ )- 

(3.29) 

It almost looks as if the Fermi surface system is growing a 
large extra dimension at low energies. We can formally define 
this extra dimension in terms of labels corresponding to points 
of the Fermi surface. The fermion operator tp(x, t) in the free 
case may be decomposed as 

i/)(x,t)= J dn?p n (n-x,t)e lkFn - x (3.30) 
Jfs 

analogous to the decomposition in one dimension. The fields 
ip n are the slowly varying fields of the low energy descrip- 
tion, and we would like to interpret the label n, corresponding 
to a point on a Fermi sphere, as an emergent dimension. Curi- 
ously, the short momentum cutoff is the conventional IR cutoff 
in real space. We are free to make this definition, but the cru- 
cial question is, does the emergent dimension have a notion of 
locality which is respected by the physical Hamiltonian? 

For free fermions locality is almost trivial since different 
patches don't interact, but in this case we barely have a smooth 
space at all, more like a collection of disconnected points. 
In the case of the Fermi liquid, the different directions n 
do interact, but the interactions are totally non-local. For- 
ward scattering interactions, parameterized by Landau param- 
eters, are not suppressed by distance in momentum space and 
hence arbitrarily distant points, as measured by distance on 
the Fermi surface, can interact strongly. More interesting is 
the case of strongly coupled Fermi surface gauge field sys- 
tems 1I5TI - I53I1 . In this case the gauge field can only couple 
nearby patches (and the antipodal patch) efficiently. Landau- 
type forward scattering interactions become irrelevant at the 
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FIG. 11: Fermi surface coarse-grained into patches of size Afc ~ 
1/r at scale r in d = 2. The number of patches grows in the IR and 
the Fermi surface contributes more and more to IR sensitive observ- 
ables. 

new fixed point and a new kind of locality emerges since dis- 
tant patches are only couple d by irrelevant operators. This de- 
coupling was used in Ref. 15411 to argue for L In (L) entropy 
in such phases. Thus in the strongly coupled system, the label 
n is much closer to a local coordinate than in the Fermi liq- 
uid, yet peculiarities remain, for example, the space may be 
better understood as a Z 2 orbifold since antipodal patches are 
strongly coupled. Similarly, there is a curious UV-IR duality 
at work: long distances in real space correspond to short dis- 
tances in momentum space and hence short distances in the 
putative emergent space. Hence we expect that simultaneous 
localization is not possible in both spaces, so perhaps there is 
a connection to non-commutative geometry. 

Branching MERA corresponds to the usual MERA except 
that at each RG step, b new branches are created, with b = 1 
giving the original MERA JH] . For example, consider a k — >• 
1 scheme in d dimensions with b branches created per step. 
The size of a given region shrinks after each coarse-graining 
step by k, but b new branches are also created. Hence the 
effective entropy that can be generated in a region, which is 
again proportional to the number of bonds that must be cut to 
isolate the region, is 

n=0 v ' 

Visually, as more an more branches are created, we can lit- 
erally see a new dimension (or multiple dimensions) grow- 
ing from the MERA. To obtain the Fermi surface scaling of 
entropy we must choose b = k d ^ 1 and hence the emergent 
dimension grows under the RG flow as an effectively d — 1 di- 
mensional object, exactly like the Fermi surface. Thus the 
branching MERA has a natural holographic interpretation. 
Just as some systems flow to a reduced effective dimensional- 
ity in the IR, other systems can grow new dimensions, and the 
branching MERA merely reflects this possibility. 

Correlation functions also have an interesting structure. In 
the branching MERA construction for free fermions, differ- 
ent branches correspond to different pieces of the Fermi sur- 
face. The uncorrelated nature of the branches matches the fact 
that different points on the Fermi surface are largely uncorre- 
lated. However, this is not true for Fermi liquids since the 
Landau forward scattering interactions connect every point to 
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every other point, albeit in a mild way. If we expect locality 
to reemerge in the strongly correlated system then the decou- 
pling of patches had better be reflected in the geometry. From 
the point of view of geodesies, the growth of the auxiliary 
sphere representing the Fermi surface makes it very hard for 
operators to meet deep within the network geometry. Thus 
only very nearby operators on the Fermi surface will be cor- 
related. Much remains to be done if we want to make this 
picture more precise. 

F. Black holes 

There is one additional extremely interesting geometrical 
feature that we will address. In Ref. 01 black hole-like 
objects were observed in the context of entanglement renor- 
malization at finite temperature. Strictly speaking, we can 
no longer speak purely of entanglement since the entropy no 
longer measures pure entanglement, but the idea of entangle- 
ment renormalization still makes sense with the understanding 
that we are potentially removing both local entanglement and 
local classical correlations. The density matrix of the system, 
assumed to be mixed, is written as 

p(T) = U RG (T)poUl G (T) (3.32) 

where po is a product state. When po is a completely mixed 
state we say the system supports a black hole-like object since 
the infinite local temperature at a true black hole horizon 
would produce such a completely mixed state. An alternate 
possibility, especially relevant for gapped systems or scale in- 
variant systems on compact spaces, is that the RG flow could 
stop before a completely mixed state is reached. In such a 
case the low energy description of the state is not expected to 
be strongly affected by a finite temperature. 

In most cases equal time correlations in quantum critical 
system at finite temperature decay exponentially with a corre- 
lation length set by the inverse temperature. In the context of 
finite temperature entanglement renormalization this decay is 
turned into a sharp cutoff with cutoff length set by the same 
scale. For distances small compared to 1/T (we assume z = 1 
here) two operators would merge in the network long before 
they probe the mixed state deep in the network, but once the 
two operators are separated by much more than 1/T, their 
causal cones will terminate at the final disentangled layer be- 
fore merging. In this case the operators are totally uncorre- 
cted, a sharp version of the exponential decay. Curiously, 
this again looks like the geodesic approximation in hologra- 
phy where geodesies between two boundary points can, be- 
yond a certain separation, disconnect and fall into the black 
hole horizon. 



G. Vector models 

To conclude this section we return to the question of vec- 
tor models. Since we have already said that the spectrum of 
scaling dimensions is quasi-free in the large TV limit, it fol- 
lows that the disentanglers and isometries of the MERA in this 



case should not be generic in the sense used above. As partial 
proof of this statement is found in the computation of univer- 
sal terms in the entanglement entropy in 2+1 dimensions. The 
entropy scales like S ~ aL — /3 with /3jy = N fif ree + 0(1) 
where f3f rP f is the universal term for a single non-interacting 
boson 11551 l5ql . Thus as measured by the universal proper- 
ties of entanglement, just as we anticipated above, the 0(N) 
model is relatively weakly interacting. The different degrees 
of freedom may still be mixed, but the interactions must be rel- 
atively weak to avoid a strongly mixed regime. We can show 
that this is the case within the context of MERA as follows. 
The vector model 

C=\ (W) 2 - m\$f) - \(($ff. (3.33) 

has an RG flow from the unstable A = point to the A = 
A c critical point. This flow takes an RG time of order one 
and the free fixed point has factorized disentanglers, so the 
disentanglers at the interacting fixed point are not expected to 
be strongly mixed. 

Vector models are also special in that they have a diverging 
number of primary fields as JV — > oo. Other weakly cou- 
pled theories without this property are also not expected to 
have very mixed disentanglers and coarse-grainers and hence 
should not have a sparse spectrum of operator dimensions. 
On the other hand, they may still add a more-or-less definite 
amount of entropy at each stage of the RG. The main point 
is now that we do not have the same separation of scales and 
hence locality below the scale of the MERA network is not 
clear. 

In contrast, the free limit of Af = 4 SYM in 3 + 1 dimen- 
sions is very far from the weakly coupled limit. We can be 
quite precise about the situation in this case since there is an 
entire fixed line parameterized by the 't Hooft coupling that 
runs from weak to strong coupling. We see immediately that 
strong coupling in the gauge theory is very far in parameter 
space from the weakly coupled limit. Hence we would expect 
a long RG flow from the free to the interacting theory and 
hence potentially very mixed disentanglers. 

IV. COMPARISON TO HOLOGRAPHY 

We will now make a comparison between entanglement 
renormalization at large N and holographic duality. Although 
our considerations are generally independent of dimension, 
we will often use the Af = 4 theory in 3 + 1 dimensions as 
a concrete example. We will first discuss the connection be- 
tween geometry and entanglement in the gravity theory, then 
we will turn to operator dimensions before finally discussing 
the question of bulk locality. 

The Af = 4 theory is dual to type IIB string theory in a 
spacetime that is asymptotically AdS^ x S 5 12411 . The R- 
symmetry SO(6) is nothing but the symmetry group of S 5 . 
We certainly will not get into the details of string theory, but 
we will note that the string theory has a number of different 
kinds of excitations including nearly massless fields known 
as supergravity fields (including the metric), much heavier 



string excitations, and other kinds of heavy excitations like 
D-branes. The important parameters of the string theory are 
the string length t s , the Planck length tp, and the AdS radius 
L. The AdS radius appears in the metric (r is the holographic 
direction representing length scale in the field theory) 

L 2 

ds 2 = — (dr 2 - dt 2 + dx 2 d ) (4.1) 

and sets the overall scale. The gauge theory parameters are 
related to these length scales via 




(4.2) 



(4 - 3) 

The strong coupling limit, A — s- oo, is therefore equivalent to 
the supergravity limit where most of the string modes become 
very heavy and decouple. Similarly, the large TV limit is a 
classical limit where the Planck length tends to zero and the 
theory is solvable via saddle point. 

We need a few elements of the holographic dictionary to 
proceed l24ll . First, r = e, with e a small length, is the bound- 
ary of the spacetime (the UV of the field theory). Second, 
bulk scalar fields of mass m are dual to boundary operators of 
dimension A where 

A(A- (d + 1)) = m 2 L 2 . (4.4) 

There are slight generalizations of this formula that are valid 
for other kinds of fields e.g. spinors and vectors. Very heavy 
bulk fields thus correspond to high dimension operators in 
the field theory. Third, at least for Einstein gravity, the Ryu- 
Takayanagi proposal appears to give the entanglement entropy 
in the large TV limit 11911 . This proposal says that the entropy 
of a region A in the field theory is the area in Planck units of 
the minimal bulk surface anchored to d A at r = e. 



A. Operator dimensions 
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FIG. 12: Holographic formulation of entanglement. The field theory 
region A is a strip of length R and width W. The bulk surface is 
shown schematically by the dotted lines. The entropy of the field 
theory region is the bulk area of the minimal surface anchored to dA 
in Planck units. 



and hence have a parametrically large dimension in the large 
TV and strongly coupled limit. There are also other heavy 
states like D-branes with masses related the Planck length and 
hence also with very large dimensions. 

Our considerations above do not match the precise powers 
of TV and g appearing in these formulas, but how could they 
when we haven't even used an explicit Hamiltonian? How- 
ever, we emphasize that we have understood why both the 
coupling g and the number of states TV enter. Furthermore, 
there is no obstacle to having a more general structure in the 
scaling dimensions while still adding entropy of order TV, we 
have simply obtained an extreme strong coupling limit. An 
alternative model for comparison is provided by the M-theory 
limit of string theory. In that model the string scale has disap- 
peared so only the Planck length remains. In this case we may 
not have a simple weakly coupled description in the phase di- 
agram at large TV and most operators will always have large 
dimension provided only that TV is large. 



The holographic dictionary relates the mass of bulk fields 
to the dimension of boundary operators. Very heavy fields, 
in units of the AdS radius, correspond to very high dimen- 
sion operators in the bulk. If we consider string theory in AdS 
space, there are at least three different mass scales in the the- 
ory. The lightest fields correspond to the low energy super- 
gravity modes. They typically have a mass set by the AdS 
radius L and hence correspond to operators of dimension of 
order one. In addition to these supergravity modes we have 
excited string states. These states have mass of order l/£ s 
with l s the string length. Now according to the dictionary 
above, the 't Hooft coupling in the JV = 4 is related to L and 
i s via A ~ (L/£ s ) 4 , hence dual operators to these bulk states 
have dimension 

A - L/l a ~ A 1/4 (4.5) 



B. Geometry, entanglement, and mutual information 

We already established in Ref. fli] that entanglement renor- 
malization provides a bound on the entropy of a region in 
terms of a minimal curve in the network geometry. How- 
ever, we have argued that at large TV each disentangler will 
add a definite and fixed amount of the entanglement to the 
state. Hence the bound we obtained in Ref. |Qj] should be- 
come tight and we can really identify area and entanglement. 
On the holographic side, the Ryu-Takayanagi formula relates 
entropy to area in the emergent bulk spacetime. An example 
of a holographic entropy calculation is shown in Fig. [T2lwhere 
the region A is a strip of length R and width W. 

Of course, within the context of holographic duality, the 
RT formula is not the end of the story. Even if it is correct 
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at large N and A (it has been proven for special geometries, 
see Ref. ll29lo . it certainly receives corrections at finite N and 
A. Sometimes the corrections take the form of some kind of 
modified area formula, but more generally we don't know the 
answer and have little reason to suspect a connection to the 
area. It is true that the higher derivative corrections to black 
hole entropy, as captured by the Wald formula, will continue 
to be proportional to the black hole area by construction, this 
formula cannot be applied more generally since bulk holo- 
graphic entanglement surfaces as not as simple as black hole 
horizons 1 . As far as entanglement renormalization is con- 
cerned, it appears to be a bulk area law in every case, at least 
at the level of the coarse network geometry. The smooth ge- 
ometry which emerges in the large N limit must come partly 
from the large N tensors that make up the network. 

The structure of holographic mutual information is also 
qualitatively similar to that obtained in entanglement renor- 
malization. In both cases we have a relatively sudden tran- 
sition from a large order N mutual information to a much 
smaller power decaying mutual information mediated by the 
few low dimension operators in the spectrum. Our assump- 
tion that the disentanglers are rapidly mixing and hence add a 
large and definite amount of entropy allows us to give an en- 
tanglement renormalization proof of the monogamy of mutual 
information discussed in Ref. 15711 to leading order in large N. 
Furthermore, the naive extension of entanglement renormal- 
ization to time dependent states should have bearing on the 
still-open question of monogamy and holographic entangle- 
ment in time-dependent backgrounds. Furthermore, the fact 
that within entanglement renormalization the large N part of 
the entropy is always related to some kind area is very sugges- 
tive. Perhaps in general we should minimize some functional 
of the area to obtain the entanglement entropy holographically 
at large N. The holographic evidence for this suggestions is 
very limited (see Ref. II57I1 for a brief discussion), so perhaps 
entanglement renormalization is giving us a strong clue here. 
There are still issues, however, since naive proposals, e.g. to 
use the Wald entropy on non-black hole horizons, appear to 
fail. 



C. Bulk locality 

Finally, let us discuss bulk locality. We have consistently 
identified the emergent holographic direction with length 
scale in the field theory, but this is only half the full story. 
More specifically, it appears that this kind of identification is 
truly generic, and the structure of entanglement renormaliza- 
tion nicely captures this structure even at small N. On the 
other hand, there are multiple length scales in the bulk, and 
we can ask about locality with respect to all of them. 

In the Af = 4 theory, there are at least three relevant lengths 
in the holographic dual, namely the AdS radius L, the string 
length t e , and the Planck length lp. Locality on the scale of 
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the AdS radius is related to the coarse RG locality present in 
the MERA network even at small N. At large N each piece of 
the MERA network adds a large amount of entropy, of order 
N, and hence corresponds on the holographic side to a large 
area in Planck units, in fact a bulk area of order L d . This 
makes sense since we identify N ~ (L/£p) 3 (d = 3) as the 
number of degrees of freedom in the Af = 4 theory. Thus to 
see bulk locality on the scale of £ s or £p we must look within 
the tensors of the MERA network. Indeed, while the large 
N entanglement entropy of the vector model was N times 
the free result, the large N entanglement of the matrix model 
does not have this property in general. Thus the tensors of the 
tensor network, which control this quantity, are indeed more 
local in the matrix model since they are effectively sensitive 
to the metric in Planck units. 

Remarkably, we already have partial evidence that there is 
an emergent local space within these tensors. This evidence 
comes from the spectrum of operator dimensions which we 
argued was sparse. It was shown in Ref. ||20ll , by considering 
a kind of bulk scattering experiment, that just such a sparse 
spectrum was necessary for bulk locality beyond the AdS ra- 
dius. Locality at the scale of the AdS radius was called coarse 
holography while locality at shorter scales was called sharp 
holography. To us it looks like the MERA naturally provides a 
coarse holographic dual for any theory, where as a sharp holo- 
graphic dual only emerges in the large N and strongly inter- 
acting limits. In contrast, the higher spin theories which have 
been suggested to be dual to vector models (specifically, the 
singlet sector) contain fields of low dimension and arbitrar- 
ily high spin. Based on our considerations here, these fields 
could lead to non-local behavior up to the scale of the AdS ra- 
dius, but beyond that they should become effectively local to 
be consistent with very general and coarse features of renor- 
malization. 

It is intriguing to think of the unitaries and isometries as 
being related to the physics of large N quantum mechanical 
systems. In these systems we know that that geometry can 
emerge from the quantum mechanics of a few large matrices 
lf58ll . This is speculation, but we suspect that such models may 
be used to explain in more detail the origin of a large space 
within a given disentangler in the appropriate strong coupling 
limit. 



V. CONTINUOUS MERA AND HOLOGRAPHY 

We now recast our considerations in the context of the con- 
tinuous MERA introduced in Ref. J3l ■ The continuous MERA 
automatically preserves translation invariance and works di- 
rectly in the continuum but otherwise shares many features 
with the discrete case. A major drawback of the continuous 
MERA is that it is not as computationally useful as the discrete 
MERA. The primary advantage of the continuous MERA will 
be the simple way in which it enables us to define the bulk ge- 
ometry. Continuous MERA may thus be a fruitful midway 
point between full-scale holography and the computational 
power of the discrete MERA. 

As originally formulated, each infinitesimal RG step in- 
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volves two operations, a disentangling operation and a scaling 
operation designed to effectively change the number of de- 
grees of freedom. Both operations are represented by unitary 
operators, so we can without trouble follow the RG flow in 
either direction. Let u denote the RG time starting from the 
UV and increasing towards the IR. u is related to our earlier 
length scale r by r = e u e. Deep in the IR, assumed to be un- 
entangled, we introduce entanglement using a transformation 



then 



g — iK{u)8u 

and then rescale the state using 

„—iD Su 



(5.1) 



(5.2) 



K is an entanglement generating operation and Dq was, in the 
original work, chosen to be a free dilatation generator. Both 
are local operators and are taken to vanish at high energies 
i.e. they only involve fields below some cutoff. The choice of 
Do presupposes a nearby free fixed point, so more generally 
we would expect to combine K and Dq into the full dilata- 
tion generator D at a fixed point. If the system undergoes an 
RG flow then one will have a crossover phenomena where the 
structure of the dilatation generator changes. Nevertheless, 
this is simply a variational ansatz, with variational parameters 
K and Dq, so we need not worry if a precise identification in 
terms of scaling generators away from fixed points is lacking. 

If \IR) denotes the unentangled IR state then our ansatz for 
the UV state is 



with 



\UV) = V\IR) (5.3) 



du'(K(u') + Dq(u')) (5.4) 



V = Vu[ exp 



and where V u denotes anti-path ordering in u. The conven- 
tions here are a little peculiar, but what we are doing is in- 
tegrating from the IR (large u) to the UV (small u) and we 
want the IR parts to hit \IR) first. If the system is near a 
critical point then we should be able to replace K + Dq with 
a u-independent D, the dilatation generator. In this case we 
would have V = e~ lUirtD except for some modifications in 
the deep UV or IR. 

Local operators inserted in the UV theory at u — can 
be moved to different values of u by following the dynamics 
generated by D. Thus we have 

OuvWV) 

= O uv e- luD e- i(uiR - u)D \IR) 

= e- mD O u e-^ UlR - u)D \IR), (5.5) 
where O u is simply 

u = e Uuve ■ (5.6) 
Now if O = Ojjv is a scaling operator satisfying 

[D, 0{x)] = iAO + ixd x O{x) (5.7) 



O u = e- uA 0(xe- u ). 



(5.8) 



Note how the inverse transformation is applied to the argu- 
ment of the field. 

If we now insert two scaling operators a distance x apart 
at u = 0, they are brought to within e of each other when 
xe~ u = e. Each operator will then have been renormalized 
by a factor of 



(5.9) 



and since the expectation value of Ouy{e)Ouv (0) is inde- 
pendent of x, it follows that 



(O uv (x)Ouv(P)) 



1 

„2A 



(5.10) 



The discussion of the spectrum of operator dimensions is 
even more straightforward in the continuous MERA. The scal- 
ing operator D, which we suppose is simply a UV cutoff 
of the continuum dilatation generator (which is anyway not 
scheme independent), obviously has the properties we want. 
It is translation invariant, it has a small number of low dimen- 
sion states, and most of the states have very large dimension. 
Translation invariance requires some explanation. The dilata- 
tion generator is not itself translation invariant as follows from 



Eq. lXTOl We have e iaP De~ 



iaP 



D + ia[P, D} = D + aP, 



but this result is sensible since we have to choose an origin for 
D. The crucial point is then that if we apply D to a translation 
invariant state with P\ip) = then the resulting state is also 
translation invariant PD\ip) = (DP — iP)\tp) = 0. 

The other properties of the dilatation generator are evident 
from the fact that the operator D is the Hamiltonian of the 
CFT on a sphere. The small number of low dimension oper- 
ators correspond to the small energies of the sphere Hamil- 
tonian while the large number of high dimension operators 
correspond to a proliferation of excited states at high energy 
JlH. On the holographic side we know the structure of the 
theory on a sphere from the dual gravitational theory in global 
AdS. For example, the Hawking-Page transition demonstrates 
the distinction between the low and high energy parts of the 
spectrum. As always, the details depend on the model, but 
the fact that D is the integral of a local operator means that 
our mixing assumption in the discrete setting above is sensi- 
ble. Indeed, the scaling superoperator in the lattice MERA is 
nothing but a regulated version of the unitary flow generated 
by D. 



A. Entanglement, geometry, and RG causality 

An important question we need to address is the causality 
properties of the evolution generated by D. Looking again 
at the motion of a scaling operator under D, we see that it 



changes position exponentially fast, O u (x) 



-An 



0{xe~ 



Thus the dynamics generated by D do not obey a conventional 
speed limit. Indeed, looking at the form of D we see that it is 
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explicitly position dependent, so we can understand its proper- 
ties in terms of a position dependent maximum velocity. This 
should be compared to standard Lieb-Robinson bounds show- 
ing that all dynamical effects in a lattice model must obey a 
universal limit, with the speed set by the size of the Hamil- 
tonian. Thus if the velocity were proportional to v ~ x we 
would indeed have the observed exponential dynamics since 
dx/du = v = x implies x(u) = x(0)e u . So there is a notion 
of causality, but with a position dependent maximum veloc- 
ity. However, we emphasize that the proof in, e.g. Ref. ll59ll . 
fails when we add a linearly growing Hamiltonian (basically 
an extra factorial is generated), so our argument for an expo- 
nentially increasing RG light cone is still heuristic. 

Nevertheless, the physical picture is compelling. As op- 
erators evolve under the dynamics generated by D they can 
spread out exponentially fast but not faster. This picture com- 
pares favorably with the conventional MERA where operators 
on distant sites also move together exponentially fast. Thus 
it seems that like the discrete case we have a notion of RG 
causality in continuous MERA. For example, a perturbation 
added at a single point should not be able to affect the state 
outside of the exponentially increasing RG causal cone. We 
can compare this with the proposals in Refs. [Ril l45ll which 
also tried to identify sub-regions of the bulk geometry that 
correspond to definite sub-regions in the field theory. In the 
context of holographic boundary conformal field theory, the 
geometry far from the impurity is also untouched in rough 
agreement with our considerations here ll43ll . 

Now, under general conditions, the rate of entropy increase 
of a region A under time evolution generated by a local Hamil- 
tonian is 



dS(A) 
du 



\8A\ 



(5.11) 



where 7 is the entanglement generation rate per degree of free- 
dom. However, it is critical to understand that 7 depends 
on the magnitude of the Hamiltonian, and this magnitude is 
growing when we consider dynamics generated by the dilata- 
tion operator. Thus if 7 for a region A of size R is effectively 
7 ~ R, all the above bound tells us is that 



dS(A) 
du 



<\A\ 



(5.12) 



which is rather weak. Nevertheless, Ref. J3] assumed that this 
bound held with a 7 of order one and then gave a scaling argu- 
ment, equivalent to our earlier argument, showing that an area 
law emerges naturally from the RG perspective. The argument 
is as follows. We expect the effective size of A to shrink under 
the RG time evolution, so the entropy per unit RG time it is 



d-l 



du. 



(5.13) 



d = 1 CFTs 01 l32ll . We start with an unentangled state and 
then evolve it in time using the dilatation generator. This sce- 
nario is remarkably like that considered in Ref. l60ll which in- 
vestigated the growth of entanglement after a quantum quench 
into a CFT from a short-range correlated state. Ref. ll60Tl 
found that the entropy of an interval grew at a constant rate 
given by 



dS_ 
~dt 



TIC 

6e 



(5.14) 



with eo some cutoff. In our case the dilatation operator is di- 
mensionless, so if we make the heuristic replacement dt/eo — > 
du then we see that each stage of the RG adds roughly c to the 
entropy. Since this holds for any CFT1+1 it confirms our ear- 
lier argument for constant entropy per RG step even at small 
N and illustrates that at large N the spectrum of the CFT can 
be mostly unrelated to the precise amount of entropy added. 

There is actually an easy way to obtain a more complete 
result in our case. We introduce twist fields *f> n that can be 
used to compute tr(p n ) within an n-copy replica theory. To be 
concrete, the correlator (^ n (0)^ n (L)) n in the n-copy theory 
is 



(* n (0)* n (L)) r 



tr 



P[om 



(5.15) 



These operators are non-local since they create branch cuts in 
the n-copy theory, yet they behave like conformal primaries 



of dimension A* = ^ (n - i) iLfl]. Note that MERA 



can handle non-local scaling operators 16211 . Now these op- 
erators have the peculiar property that if we take their ex- 
pectation value in an unentangled pure state, then we get 
($ n (0)^ n (L)) n = 1 independent of L. This is because 

tr ^pjj, = 1 in such an unentangled pure state. 

Our task is then to compute the correlator of twist fields in 



the state lit) 



-iuD n 



\IR) n in the n-copy theory. Since 



^f n is a scaling operator we can use Eq. 15.81 to renormalize 
If u < In (L/e) then the twist fields will be renormalized 
to the IR before merging, so we obtain the correlator 

(«|*„(0)*„(L)|«)„ 

= (IR\e- uA *V n {0)e- uA *y n {Le- u )\IR) n 



-2AmU 



(5.16) 



The entropy of the region L in this state is thus 



1 



i( 1 + ^ M 



(5.17) 



When we integrate dS from u = to u = In (R/e) we obtain 
the area law when d > 1. 

How much entropy is the RG circuit actually adding at ev- 
ery step? In general, we have called this entropy the entan- 
glement per scale and we identified it as the central charge in 



Taking the n — > 1 limit we have S = S\ = cu/'i and hence 
dS/du = c/3. This growth with u is cutoff beyond u = 
In (L/e) since the twist fields merge before making it to the 
IR scale. Thus going from \u) to \u + du) does indeed add 
(c/3)du entropy to an interval. 
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More generally, the above scaling argument and CFT calcu- 
lations can only take us so far. They exclude the possibility of 
a number of low energy degrees of freedom that grows with 
coarse-graining. For example, in the case of the Fermi sur- 
face it is true that the region shrinks exponentially fast with u, 
but the number of low energy modes also grows exponentially 
fast with u in precisely the same way. These two effects then 
cancel so that 



dS(A) 



d-l 



(k F e u e) d - 1 du, 



(5.18) 



when integrated, gives a logarithmic correction to the area law. 
We also see that in principle even more severe violations of the 
area law are still consistent with the structure of the evolution 
generated by the dilatation operator. 

With this in mind, let us turn to the interesting proposal in 
Ref. Jg] to define a holographic metric based on the contin- 
uous MERA. They take the overlap between states that dif- 
fer by infinitesimal RG times to define the holographic radial 
component of the metric. They showed based on their def- 
inition that sensible-looking scale invariant geometries were 
produced. One objection is that the large N scaling isn't quite 
right in their proposal. This is because the differential overlap 
should go like N but in the holographic context one expects 
a different dimension dependent power of N to appear in the 
metric so that the area in Planck units is ultimately propor- 
tional to N. However, this is a minor issue. Nevertheless, a 
possible refinement of the proposal in Ref. J61] more in line 
with the original suggestion of Ref. |Q]] would be to define 
local areas in the bulk in terms of differential entanglement 
generated. 

Roughly speaking, we can break the generator of the RG 
time evolution up into different local pieces. Each piece acting 
for an infinitesimal RG time would then give a little piece of 
bulk area corresponding to d — 1 field theory directions and 
the emergent holographic direction. A more elegant approach 
is provided by the following construction. The infinitesimal 
contribution to the area of a minimal surface anchored at the 
boundary of A is defined to be proportional to the change in 
entropy dS(A) where 



dS(A) = S(A, u + du)- S{A, u) 



(5.19) 



where S(A, u) is the entropy of A in the state including de- 
grees of freedom from scales < u. Then the bulk metric would 
be recovered as an inverse problem in terms of these entropies. 
The resulting bulk spatial metric (r = e u e) can be parameter- 
ized in terms of two functions as 



as MERA 



f(r)dr 2 + g{r)dx 2 



(5.20) 



and the key problem is to show that the entropy definition is 
consistent with a geometric computation depending only on 
these two functions. 

We can also use these considerations to address the question 
of Fermi surface-like entanglement in a holographic setting. 
If, as we argued above, the Fermi surface actually gives an 
equivalent contribution to the entropy at every scale (the de- 
creasing size of dA and increasing size of the Fermi surface 



cancel), then the bulk area element should be 

dAbuik 



dr 

dUdu dT, — 
r 



(5.21) 



where g?£ is an infinitesimal element of dA. One can sim- 
ply carry out the continuous MERA optimization procedure 
for a two dimensional Fermi gas to verify this scaling. The 
dilatation generator ends up being a sum of one dimensional 
dilatation generators for every point of the Fermi surface. 

Setting g(r) = r~ 2 as usual, the area element from the 
metric ds 2 MERA is 



dA bu i k = — V f(r)dr 



(5.22) 



which requires yff = r d 2 . Writing the metric in a standard 
form in the putative Fermi surface case we thus find 



L 2 

MERA = 



r 2 ^dr 2 + dx 2 ) 



(5.23) 



Remarkably, just this metric has recently been obtained in 
the holographic context as a candidate to describe systems 
with emergent Fermi surfaces J63i l64ll . The considerations 
on shape dep endence in Ref. Il64ll combined with the results 
of Ref. B65I1 show that the leading part of entanglement en- 
tropy of a Fermi liquid can be consistently interpreted as the 
area in a higher dimension geometry. 

As a technical note, more generally we should take into ac- 
count the changing shape of dA, for example, in d = 1 if the 
boundary of an interval at scale r is x(r) the area element is 



dA bu ik = -i/l 



' dr 
r 



(5.24) 



assuming f = g = r~ 2 . This leads to a slight refinement of 
our formula above, but this is irrelevant in the Fermi surface 
case. Note that our argument is also consistent with a metric 
that includes an extra d—1 dimensional sphere as 



ds' 



L 2 



dx„ 



{k,Fr) 2 ds 2 sd - 



(5.25) 



In this geometry a minimal surface including all of the sphere 
will have an area element of 



(5.26) 



which also integrates to a logarithmic term. This should be 
compared to the branching MERA story above. 

Of course, a very similar argument was used to guess the 
metric in Eq. I5.23l in a purely holographic context (the metric 
in Eq. |5.25| is more novel). The advantages of our proposal are 
severalfold. First, we know by definition that area and geom- 
etry are related, while on the holographic side one can ques- 
tion the Ryu-Takayanagi formula. Second, we know exactly 
what system we have in mind, an unusual luxury in holog- 
raphy. Third, we have a precise field theory construction of 
a putative holographic state. However, we should emphasize 
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that this is a preliminary proposal, and much additional work 
needs to be done. 

Finally, we cannot help but mention a field theory that po- 
tentially combines large N physics with that of a Fermi sur- 
face. This theory is a gauge theory with fermions in the adjoint 
representation and carrying a conserved charge. Unlike the 
vector-like fermion matter content in Refs. |5U 15311 . this the- 
ory has a sensible large N limit with controlled corrections. 
By adding a chemical potential for the conserved charge, it 
becomes possible that this system realizes non-Fermi liquid 
physics over a wide range of energy scales even if it is eventu- 
ally unstable. Similar models in one dimension have recently 
been been shown to exhibit interesting features in Ref. l66ll . 

B. Black holes 

We can also generalize our considerations about black holes 
to the continuous case. Consider a thermal state (not normal- 
ized) of some conformal field theory given by 

p(T) = e~ 0H . (5.27) 

We can formally consider the transformed state 

e luD p{T)e~ luD = e~^ UH = p(Te u ), (5.28) 

where we say "formally" because the trace of p is changed 
so e zuD is not really a unitary operator in this space. Nev- 
ertheless, a regulated version of this statement is precisely 
the content of Eq. 12.301 Thus starting from some nearly to- 
tally thermalized (nearly infinite temperature) state in the IR 
P o = e~P° H with /?o ~ we can form the state at a sensible 
finite temperature by writing 

p(T) = e~ 0H = e- luD p e luD (5.29) 

with u = ln(/3//3o). In particular, the entanglement geom- 
etry of this quantum circuit is completely identical for short 
distances (distances short compared to 1/T) to the geometry 
of the ground state. 

The entropy of a region in the field theory will now receive 
two contributions, one from the entanglement part of the ge- 
ometry and one from the thermalized degrees of freedom in 
the deep IR. Furthermore, the entropy will exhibit an entan- 
glement to thermal crossover as a function of region size and 
temperature. Finally, note that the state in Eq. IV Bl is only 
schematic since the unitary e~ vuD is only the approximate RG 
circuit. In general we have 

p{T) = V(T)p V^(T) (5.30) 

with V(T) given by e~ luD for most of the RG time but with 
corrections near the deep UV and IR. Correlations in this state 
also follow the general scheme outlined above. For operators 
nearer than roughly 1/T, the two-point function is roughly 
that of the zero temperature system. On the other hand, for 
distances greater than 1/T the dilatation dynamics does not 
bring the two operators together before they run into the de- 
coupled state in the IR. 



Let us also compare the total RG flow time. In our entan- 
glement renormalization proposal the total RG time from UV 
cutoff to zero IR entanglement is u = In ((3/(3q). We convert 
f3 to a length rr = v/T using a velocity v ~ J/e (previously 
set to one) with J is the energy scale of the cutoff Hamilto- 
nian. Taking j3o ~ 1/ J we get an RG time of u ~ In (rr / e)- 
On the holographic side, the black hole metric, in the planar 
AdSs case, is 

ds2 = (jJ7) dr2 ~ /(r) ^ 2 + ^ ■ (5J1) 

The function / is f(r) = 1 — J" 4 / r fi where r/, is the horizon 
position which scales as 1/T. Computing the distance from a 
UV cutoff r = e to the horizon r = rj, ~ we find 

fds=f —^=dr ~ In (r T /e) (5.32) 
in agreement with the entanglement renormalization picture. 

VI. DISCUSSION 

In this paper we have considered the problem of entan- 
glement renormalization in strongly coupled large N theo- 
ries. We argued that two particular features of these models 
relevant for holography, namely a sparse spectrum of oper- 
ator dimensions and the emergence of a classical geometry 
built from entanglement, emerge naturally from entanglement 
renormalization. We also addressed many other features of 
entanglement renormalization and holography, including gen- 
eralizations to include Fermi surfaces, the detailed structure of 
mutual information and correlations, and the generalization to 
continuous MERA. 

Our work has also inspired us to think about several issues 
in holography proper from a potentially unique direction. One 
interesting direction is to consider the role of wormholes in 
holo grap hy. A classic theorem known as topological censor- 
ship 16711 forbids the existence of wormholes connecting dif- 
ferent asymptotically AdS spaces when the null energy condi- 
tion is satisfied. We began thinking about wormholes by ask- 
ing the question, "If spacetime is entanglement, then is there 
anything entanglement can't be used to do?" One simple an- 
swer is that entanglement cannot be used to signal in the ab- 
sence of classical correlations. Ultimately, this is just a trivial 
consequence of local dynamics, however, we feel the entan- 
glement perspective is useful. Thus while spacetime may be 
composed of entanglement, it should not be possible to use 
entanglement alone to communicate between different asymp- 
totic boundaries. 

Furthermore, since the null energy condition is not expected 
to be fundamental, e.g. it is violated by quantum and stringy 
effects, a general principle for a sensible low energy grav- 
ity action should be that traversable wormholes are forbid- 
den. Now in the context of higher spin theories some worm- 
hole geometries have actually been written down, however, 
it is crucial that in these models the metric is badly gauge 
non-invariant. Thus the wormhole may or may not exist in a 
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given gauge. The challenge here is to formulate the correct 
notion locality and show that one cannot communicate from 
one boundary to another. In theories for which we understand 
the correct notion of locality, we should simply forbid matter 
or modifications to gravity that permit wormholes. 

It would also be very desirable to formulate our consid- 
erations in a more covariant fashion to further bridge the 
gulf between entanglement renormalization and holography. 
Such a covariant formulation would enable us to address ex- 
citing dynamical questions including quantum quenches and 
thermalization. Of course, we can simply do entanglement 
renormalization on the time dependent quantum state, but this 
seems inelegant since time and space are treated very dif- 
ferently. We do know something about dynamics since the 
MERA encodes the scaling dimension of the Hamiltonian 
(and hence effectively the dt 2 term in the metric). Further- 
more, using the Heisenberg picture we can compute various 
time dependent correlation functions starting from an initial 
state given by MERA, but we would also like to make sense 
of the Schrodinger picture where one might think of the ge- 
ometry/tensor network as evolving. Related to these consider- 
ations, we would like to better understand the relationship be- 
tween entanglement renormalization and other forms of renor- 
malization typically formulated in terms of the partition func- 
tion or path integral. 

Another important question involves symmetry. We have 
not focused on symmetry in this work, but it is known how 
to include symmetry in the tensor network description l68ll . 
This is done with the aid of representation theory and Pen- 
rose's spin networks (see Ref. 1690 f° r a gentle introduction). 
Those familiar with lattice gauge theory will immediately rec- 
ognize that the appropriate interpretation of these diagrams is 
in terms of electric flux ll70l ? ] . Thus by tracking the flow of 
representations down through the MERA we are naturally led 
to think in terms of emergent electric field lines in the bulk. 
This is nothing but the usual statement that a boundary global 



symmetry is dual to a bulk gauge field. It would be very in- 
teresting to develop this connection further and, for example, 
look for analogues of Luttinger's theorem lp7~TTl in entangle- 
ment renormalization. There are also intriguing connections 
between holographic duality and spin networks in loop quan- 
tum gravity e.g. Ref. £72] which have hardly been explored. 

Our discussion here also leads to another natural numerical 
direction. One can look for critical theories not be minimiz- 
ing the ground state of some local Hamiltonian but instead 
by searching for MERA networks that give rise to translation 
invariant states. The idea is that imposing symmetry plus the 
right structure of entanglement should permit us to write down 
large classes of critical states without ever needing a parent 
Hamiltonian. This is work in progress. Also, it is not incon- 
ceivable that one could just do entanglement renormalization, 
say in one dimension, for a lattice model that realizes some 
strongly coupled large N matrix model. This would be really 
exciting. We could really fully test holographic duality since 
we would have the solution on both sides of the duality. 

Building on the work of Ref. HI and using the large N 
limit, we have made a strong case for a non-trivial connec- 
tion between entanglement renormalization and holography. 
Of course, the connection could basically stop here, although 
we think otherwise. Nevertheless, the situation is positive in 
any event, if these two beautiful ideas are actually the same 
thing, great, otherwise they both contain a huge number of 
unexpected generic features of quantum many-body systems. 
Either way our understanding of quantum matter is vastly en- 
hanced. 
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